A bottom-up, stepwise, knowledge integration framework is proposed to realize detection-based, large vocabulary continuous speech recognition (LVCSR) with a weighted finite state machine (WFSM). The WFSM framework offers a flexible architecture for different types of knowledge network compositions, each of them can be built and optimized independently. Speech attribute detectors are used as an intermediate block to obtain phoneme posterior probabilities over which a phoneme recognition network is designed. Lexical access and syntax knowledge integration over this phoneme network are then performed to deliver the decoded sentences. Experimental evidence illustrates that the proposed system outperforms several hybrid HMM/ANN systems with different configurations on the Wall Street Journal task while it is competitive with conventional LVCSR technology.

S. M. SINISCALCHI, T. SVENDSEN, AND C.-H. LEE (2011). A bottom-up stepwise knowledge-integration approach to large vocabulary continuous speech recognition using weighted finite state machines. In IEEE INTERSPEECH 2011 (pp. 901-904). ISCA-INT SPEECH COMMUNICATION ASSOC, [10.21437/Interspeech.2011-351].

A bottom-up stepwise knowledge-integration approach to large vocabulary continuous speech recognition using weighted finite state machines

S. M. SINISCALCHI
Primo
Investigation
;
2011-01-01

Abstract

A bottom-up, stepwise, knowledge integration framework is proposed to realize detection-based, large vocabulary continuous speech recognition (LVCSR) with a weighted finite state machine (WFSM). The WFSM framework offers a flexible architecture for different types of knowledge network compositions, each of them can be built and optimized independently. Speech attribute detectors are used as an intermediate block to obtain phoneme posterior probabilities over which a phoneme recognition network is designed. Lexical access and syntax knowledge integration over this phoneme network are then performed to deliver the decoded sentences. Experimental evidence illustrates that the proposed system outperforms several hybrid HMM/ANN systems with different configurations on the Wall Street Journal task while it is competitive with conventional LVCSR technology.
2011
Settore IINF-05/A - Sistemi di elaborazione delle informazioni
978-1-61567-692-7
S. M. SINISCALCHI, T. SVENDSEN, AND C.-H. LEE (2011). A bottom-up stepwise knowledge-integration approach to large vocabulary continuous speech recognition using weighted finite state machines. In IEEE INTERSPEECH 2011 (pp. 901-904). ISCA-INT SPEECH COMMUNICATION ASSOC, [10.21437/Interspeech.2011-351].
File in questo prodotto:
File Dimensione Formato  
siniscalchi_interspeech2011.pdf

Solo gestori archvio

Descrizione: Il testo pieno dell’articolo è disponibile al seguente link: https://www.isca-archive.org/interspeech_2011/siniscalchi11_interspeech.html
Tipologia: Versione Editoriale
Dimensione 308.34 kB
Formato Adobe PDF
308.34 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/663735
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 16
  • ???jsp.display-item.citation.isi??? 2
social impact