In recent research, we have proposed a high-accuracy bottom-up detection-based paradigm for continuous phone speech recognition. The key component of our system was a bank of articulatory detectors each of which computes a score describing an activation level of the specified speech phonetic features that the current frame exhibits. In this work, we present our first attempt at designing a universal phone recognizer using the detection-based approach. We show that our technique is intrinsically language independent since reliable articulatory detectors can be designed for diverse languages, and robust detection can be performed across languages. Moreover, a universal set of detectors is designed by sharing the training material available for several diverse languages. We further demonstrate that our approach makes it possible to decode new target languages by neither retraining nor applying acoustic adaptation techniques. We report phone recognition performance that compares favorably with the best results known by the authors on the OGI Multi-language Telephone Speech corpus.

S. M. SINISCALCHI, T. SVENDSEN, AND C.-H. LEE (2008). Toward a detector-based universal phone recognizer. In ICASSP (pp. 4261-4264) [10.1109/ICASSP.2008.4518596].

Toward a detector-based universal phone recognizer

S. M. SINISCALCHI;
2008-01-01

Abstract

In recent research, we have proposed a high-accuracy bottom-up detection-based paradigm for continuous phone speech recognition. The key component of our system was a bank of articulatory detectors each of which computes a score describing an activation level of the specified speech phonetic features that the current frame exhibits. In this work, we present our first attempt at designing a universal phone recognizer using the detection-based approach. We show that our technique is intrinsically language independent since reliable articulatory detectors can be designed for diverse languages, and robust detection can be performed across languages. Moreover, a universal set of detectors is designed by sharing the training material available for several diverse languages. We further demonstrate that our approach makes it possible to decode new target languages by neither retraining nor applying acoustic adaptation techniques. We report phone recognition performance that compares favorably with the best results known by the authors on the OGI Multi-language Telephone Speech corpus.
2008
978-1-4244-1484-0
1-4244-1484-9
S. M. SINISCALCHI, T. SVENDSEN, AND C.-H. LEE (2008). Toward a detector-based universal phone recognizer. In ICASSP (pp. 4261-4264) [10.1109/ICASSP.2008.4518596].
File in questo prodotto:
File Dimensione Formato  
ICASSP2008_Siniscalchi.pdf

Solo gestori archvio

Tipologia: Versione Editoriale
Dimensione 649.07 kB
Formato Adobe PDF
649.07 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/649519
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 52
  • ???jsp.display-item.citation.isi??? 35
social impact