Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

We study lattice rescoring with knowledge scores for automatic speech recognition. Frame-based log likelihood ratio is adopted as a score measure of the goodness-of-fit between a speech segment and the knowledge sources. We evaluate our approach in two different applications: phone recognition, and connected digit continuous recognition. By incorporating knowledge scores obtained from 15 attribute detectors for place and manner of articulation, we reduced phone error rate from 40.52% to 35.16% using monophone models. The error rate can be further reduced to 33.42% for triphone models. The same lattice rescoring algorithm is extended to connected digit recognition using the TIDIGITS database, and without using any digit-specific training data. We observed the digit error rate can be effectively reduced to 4.03% from 4.54% which was obtained with the conventional Viterbi decoding algorithm with no knowledge scores.

S. M. SINISCALCHI, J. LI, AND C.-H. LEE (2006). A study on lattice rescoring with knowledge scores for automatic speech recognition. In INTERSPEECH 2006 - ICSLP (pp. 517-520). ISCA - International Speech Communication Association.

A study on lattice rescoring with knowledge scores for automatic speech recognition

S. M. SINISCALCHI^{Primo

Formal Analysis};

2006-01-01

Abstract

We study lattice rescoring with knowledge scores for automatic speech recognition. Frame-based log likelihood ratio is adopted as a score measure of the goodness-of-fit between a speech segment and the knowledge sources. We evaluate our approach in two different applications: phone recognition, and connected digit continuous recognition. By incorporating knowledge scores obtained from 15 attribute detectors for place and manner of articulation, we reduced phone error rate from 40.52% to 35.16% using monophone models. The error rate can be further reduced to 33.42% for triphone models. The same lattice rescoring algorithm is extended to connected digit recognition using the TIDIGITS database, and without using any digit-specific training data. We observed the digit error rate can be effectively reduced to 4.03% from 4.54% which was obtained with the conventional Viterbi decoding algorithm with no knowledge scores.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				2006
			
	ISBN della monografia 
DATO PREVISTO SU LOGINMIUR
	
				978-1-60423-449-7
			
	Citazione
	
				S. M. SINISCALCHI,  J. LI,  AND C.-H. LEE (2006). A study on lattice rescoring with knowledge scores for automatic speech recognition. In INTERSPEECH 2006 - ICSLP (pp. 517-520). ISCA - International Speech Communication Association.
			
	Appare nelle tipologie:
	
				2.07 Contributo in atti di convegno pubblicato in volume

File in questo prodotto:

File	Dimensione	Formato
INTERSPEECH_2006.pdf Solo gestori archvio Tipologia: Versione Editoriale Dimensione 126.47 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	126.47 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/624144

Citazioni

ND

23

18

social impact