We have recently proposed a universal acoustic characterisation to foreign accent recognition, in which any spoken foreign accent was described in terms of a common set of fundamental speech attributes. Although experimental evidence demonstrated the feasibility of our approach, we belive that speech attributes, namely manner and place of articulation, can be better modelled by a deep neural network. In this work, we propose the use of deep neural network trained on telephone bandwidth material from different languages to improve the proposed universal acoustic characterisation. We demonstrate that deeper neural architectures enhance the attribute classification accuracy. Furthermore, we show that improvements in attribute classification carry over to foreign accent recognition by producing a 21% relative improvement over previous baseline on spoken Finnish, and a 5.8% relative improvement on spoken English

Hautamaki, V., SINISCALCHI, S.M., Behravan, H., Salerno, V.M., Kukanov, I. (2015). Boosting universal speech attributes classification with deep neural network for foreign accent characterization. In INTERSPEECH 2015 (pp. 408-412). ISCA-INT SPEECH COMMUNICATION ASSOC [10.21437/Interspeech.2015-165].

Boosting universal speech attributes classification with deep neural network for foreign accent characterization

SINISCALCHI, SABATO MARCO;
2015-06-01

Abstract

We have recently proposed a universal acoustic characterisation to foreign accent recognition, in which any spoken foreign accent was described in terms of a common set of fundamental speech attributes. Although experimental evidence demonstrated the feasibility of our approach, we belive that speech attributes, namely manner and place of articulation, can be better modelled by a deep neural network. In this work, we propose the use of deep neural network trained on telephone bandwidth material from different languages to improve the proposed universal acoustic characterisation. We demonstrate that deeper neural architectures enhance the attribute classification accuracy. Furthermore, we show that improvements in attribute classification carry over to foreign accent recognition by producing a 21% relative improvement over previous baseline on spoken Finnish, and a 5.8% relative improvement on spoken English
giu-2015
978-1-5108-1790-6
Hautamaki, V., SINISCALCHI, S.M., Behravan, H., Salerno, V.M., Kukanov, I. (2015). Boosting universal speech attributes classification with deep neural network for foreign accent characterization. In INTERSPEECH 2015 (pp. 408-412). ISCA-INT SPEECH COMMUNICATION ASSOC [10.21437/Interspeech.2015-165].
File in questo prodotto:
File Dimensione Formato  
i15_0408.pdf

Solo gestori archvio

Descrizione: Il testo pieno dell’articolo è disponibile al seguente link: https://www.isca-archive.org/interspeech_2015/hautamaki15_interspeech.html#
Tipologia: Versione Editoriale
Dimensione 268.09 kB
Formato Adobe PDF
268.09 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/649577
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 21
  • ???jsp.display-item.citation.isi??? 11
social impact