Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

In a recent study, we proposed soft margin estimation (SME) to learn parameters of continuous density hidden Markov models (HMMs). Our earlier experiments with connect digit recognition have shown that SME offers great advantages over other state-of-the-art discriminative training methods. In this paper, we illustrate SME from a perspective of statistical learning theory and show that by including a margin in formulating the SME objective function it is capable of directly minimizing the approximate test risk, while most other training methods intent to minimize only the empirical risks. We test SME on the 5k-word Wall Street Journal task, and find the proposed approach achieves a relative word error rate reduction of about 10% over our best baseline results in different experimental configurations. We believe this is the first attempt to show the effectiveness of margin-based acoustic modeling for large vocabulary continuous speech recognition. We also expect further performance improvements in the future because the approximate test risk minimization principle offers a flexible and yet rigorous framework to facilitate easy incorporation of new margin-based optimization criteria into HMM training.

Li J., Siniscalchi S., Lee C. (2007). Approximate test risk minimization through soft margin estimation. In 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07 [10.1109/ICASSP.2007.366997].

Approximate test risk minimization through soft margin estimation

Li J.;Siniscalchi S.;Lee C.

2007-01-01

Abstract

In a recent study, we proposed soft margin estimation (SME) to learn parameters of continuous density hidden Markov models (HMMs). Our earlier experiments with connect digit recognition have shown that SME offers great advantages over other state-of-the-art discriminative training methods. In this paper, we illustrate SME from a perspective of statistical learning theory and show that by including a margin in formulating the SME objective function it is capable of directly minimizing the approximate test risk, while most other training methods intent to minimize only the empirical risks. We test SME on the 5k-word Wall Street Journal task, and find the proposed approach achieves a relative word error rate reduction of about 10% over our best baseline results in different experimental configurations. We believe this is the first attempt to show the effectiveness of margin-based acoustic modeling for large vocabulary continuous speech recognition. We also expect further performance improvements in the future because the approximate test risk minimization principle offers a flexible and yet rigorous framework to facilitate easy incorporation of new margin-based optimization criteria into HMM training.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				2007
			
	DOI del contributo 
DATO PREVISTO SU LOGINMIUR
	
				https://dx.doi.org/10.1109/ICASSP.2007.366997
			
	URL dell'editore (Open access ove possibile)
	
				1-4244-0728-1
			
	Citazione
	
				Li J.,  Siniscalchi S.,  Lee C. (2007). Approximate test risk minimization through soft margin estimation. In 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07 [10.1109/ICASSP.2007.366997].
			
	Appare nelle tipologie:
	
				2.07 Contributo in atti di convegno pubblicato in volume

File in questo prodotto:

File	Dimensione	Formato
Approximate_Test_Risk_Minimization_Through_Soft_Margin_Estimation.pdf Solo gestori archvio Tipologia: Versione Editoriale Dimensione 4.98 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	4.98 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/673803

Citazioni

ND

11

ND

social impact