We propose a novel universal acoustic characterization approach to spoken language recognition (LRE). The key idea is to describe any spoken language with a common set of fundamental units that can be defined "universally" across all spoken languages. In this study, speech attributes, such as manner and place of articulation, are chosen to form this unit inventory and used to build a set of language-universal attribute models with data-driven modeling techniques. The vector space modeling approach to LRE is adopted, where a spoken utterance is first decoded into a sequence of attributes independently of its language. Then, a feature vector is generated by using co-occurrence statistics of manner or place units, and the final LRE decision is implemented with a vector space language classifier. Several architectural configurations will be studied, and it will be shown that best performance is attained using a maximal figure-of-merit language classifier. Experimental evidence not only demonstrates the feasibility of the proposed techniques, but it also shows that the proposed technique attains comparable performance to standard approaches on the LRE tasks investigated in this work when the same experimental conditions are adopted.

SINISCALCHI, S.M., Reed J., Svendsen T., Lee C. H. (2013). Universal attribute characterization of spoken languages for automatic spoken language recognition. COMPUTER SPEECH AND LANGUAGE, 27(1), 209-227 [10.1016/j.csl.2012.05.001].

Universal attribute characterization of spoken languages for automatic spoken language recognition

SINISCALCHI, SABATO MARCO
Primo
Investigation
;
2013-01-01

Abstract

We propose a novel universal acoustic characterization approach to spoken language recognition (LRE). The key idea is to describe any spoken language with a common set of fundamental units that can be defined "universally" across all spoken languages. In this study, speech attributes, such as manner and place of articulation, are chosen to form this unit inventory and used to build a set of language-universal attribute models with data-driven modeling techniques. The vector space modeling approach to LRE is adopted, where a spoken utterance is first decoded into a sequence of attributes independently of its language. Then, a feature vector is generated by using co-occurrence statistics of manner or place units, and the final LRE decision is implemented with a vector space language classifier. Several architectural configurations will be studied, and it will be shown that best performance is attained using a maximal figure-of-merit language classifier. Experimental evidence not only demonstrates the feasibility of the proposed techniques, but it also shows that the proposed technique attains comparable performance to standard approaches on the LRE tasks investigated in this work when the same experimental conditions are adopted.
2013
SINISCALCHI, S.M., Reed J., Svendsen T., Lee C. H. (2013). Universal attribute characterization of spoken languages for automatic spoken language recognition. COMPUTER SPEECH AND LANGUAGE, 27(1), 209-227 [10.1016/j.csl.2012.05.001].
File in questo prodotto:
File Dimensione Formato  
S0885230812000307_.pdf

Solo gestori archvio

Tipologia: Versione Editoriale
Dimensione 367.02 kB
Formato Adobe PDF
367.02 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/649534
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 53
  • ???jsp.display-item.citation.isi??? 39
social impact