This study is a result of a collaboration project between two groups, one from Brno University of Technology and the other from Georgia Institute of Technology (GT). Recently the Brno recognizer is known to outperform many state-of-the-art systems on phone recognition, while the GT knowledge-based lattice rescoring module has been shown to improve system performance on a number of speech recognition tasks. We believe a combination of the two system results in high-accuracy phone recognition. To integrate the two very different modules, we modify Brno's phone recognizer into a phone lattice hypothesizer to produce high-quality phone lattices, and feed them directly into the knowledge-based module to rescore the lattices. We test the combined system on the TIMIT continuous phone recognition task without retraining the individual subsystems, and we observe that the phone error rate was effectively reduced to 19.78% from 24.41% produced by the Brno phone recognizer. To the best of the authors' knowledge this result represents the lowest ever error rate reported on the TIMIT continuous phone recognition task.

S. M. SINISCALCHI, P. SCHWARZ, C.-H. LEE (2007). High-accuracy phone recognition by combining high-performance lattice generation and knowledge based rescoring. In IEEE ICASSP (pp. 869-872). IEEE Computer Society [10.1109/ICASSP.2007.367208].

High-accuracy phone recognition by combining high-performance lattice generation and knowledge based rescoring

S. M. SINISCALCHI;
2007-01-01

Abstract

This study is a result of a collaboration project between two groups, one from Brno University of Technology and the other from Georgia Institute of Technology (GT). Recently the Brno recognizer is known to outperform many state-of-the-art systems on phone recognition, while the GT knowledge-based lattice rescoring module has been shown to improve system performance on a number of speech recognition tasks. We believe a combination of the two system results in high-accuracy phone recognition. To integrate the two very different modules, we modify Brno's phone recognizer into a phone lattice hypothesizer to produce high-quality phone lattices, and feed them directly into the knowledge-based module to rescore the lattices. We test the combined system on the TIMIT continuous phone recognition task without retraining the individual subsystems, and we observe that the phone error rate was effectively reduced to 19.78% from 24.41% produced by the Brno phone recognizer. To the best of the authors' knowledge this result represents the lowest ever error rate reported on the TIMIT continuous phone recognition task.
2007
Settore IINF-05/A - Sistemi di elaborazione delle informazioni
1-4244-0727-3
S. M. SINISCALCHI, P. SCHWARZ, C.-H. LEE (2007). High-accuracy phone recognition by combining high-performance lattice generation and knowledge based rescoring. In IEEE ICASSP (pp. 869-872). IEEE Computer Society [10.1109/ICASSP.2007.367208].
File in questo prodotto:
File Dimensione Formato  
ICASSP_2007_BUT.pdf

Solo gestori archvio

Tipologia: Versione Editoriale
Dimensione 5.15 MB
Formato Adobe PDF
5.15 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/670049
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 22
  • ???jsp.display-item.citation.isi??? 14
social impact