Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

A bottom-up, stepwise, knowledge integration framework is proposed to realize detection-based, large vocabulary continuous speech recognition (LVCSR) with a weighted finite state machine (WFSM). The WFSM framework offers a flexible architecture for different types of knowledge network compositions, each of them can be built and optimized independently. Speech attribute detectors are used as an intermediate block to obtain phoneme posterior probabilities over which a phoneme recognition network is designed. Lexical access and syntax knowledge integration over this phoneme network are then performed to deliver the decoded sentences. Experimental evidence illustrates that the proposed system outperforms several hybrid HMM/ANN systems with different configurations on the Wall Street Journal task while it is competitive with conventional LVCSR technology.

S. M. SINISCALCHI, T. SVENDSEN, AND C.-H. LEE (2011). A bottom-up stepwise knowledge-integration approach to large vocabulary continuous speech recognition using weighted finite state machines. In IEEE INTERSPEECH 2011 (pp. 901-904). ISCA-INT SPEECH COMMUNICATION ASSOC, [10.21437/Interspeech.2011-351].

A bottom-up stepwise knowledge-integration approach to large vocabulary continuous speech recognition using weighted finite state machines

S. M. SINISCALCHI^{Primo

Investigation};T. SVENDSEN;AND C.-H. LEE

2011-01-01

Abstract

A bottom-up, stepwise, knowledge integration framework is proposed to realize detection-based, large vocabulary continuous speech recognition (LVCSR) with a weighted finite state machine (WFSM). The WFSM framework offers a flexible architecture for different types of knowledge network compositions, each of them can be built and optimized independently. Speech attribute detectors are used as an intermediate block to obtain phoneme posterior probabilities over which a phoneme recognition network is designed. Lexical access and syntax knowledge integration over this phoneme network are then performed to deliver the decoded sentences. Experimental evidence illustrates that the proposed system outperforms several hybrid HMM/ANN systems with different configurations on the Wall Street Journal task while it is competitive with conventional LVCSR technology.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				2011
			
	Settore scientifico disciplinare del contributo
	
				Settore IINF-05/A - Sistemi di elaborazione delle informazioni
			
	ISBN della monografia 
DATO PREVISTO SU LOGINMIUR
	
				978-1-61567-692-7
			
	DOI del contributo 
DATO PREVISTO SU LOGINMIUR
	
				https://dx.doi.org/10.21437/Interspeech.2011-351
			
	URL dell'editore (Open access ove possibile)
	
				https://www.isca-archive.org/interspeech_2011/siniscalchi11_interspeech.html
			
	Citazione
	
				S. M. SINISCALCHI,  T. SVENDSEN,  AND C.-H. LEE (2011). A bottom-up stepwise knowledge-integration approach to large vocabulary continuous speech recognition using weighted finite state machines. In IEEE INTERSPEECH 2011 (pp. 901-904). ISCA-INT SPEECH COMMUNICATION ASSOC, [10.21437/Interspeech.2011-351].
			
	Appare nelle tipologie:
	
				2.07 Contributo in atti di convegno pubblicato in volume

File in questo prodotto:

File	Dimensione	Formato
siniscalchi_interspeech2011.pdf Solo gestori archvio Descrizione: Il testo pieno dell’articolo è disponibile al seguente link: https://www.isca-archive.org/interspeech_2011/siniscalchi11_interspeech.html Tipologia: Versione Editoriale Dimensione 308.34 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	308.34 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/663735

Citazioni

ND

16

2

social impact