Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

Recently, we have proposed a detection-based speech recognizer which has two main components: a bank of phonetic feature detectors implemented with hidden Markov models (HMMs), and an event merger. Each detector generates a score that pertains to some phonetic features, e.g. voicing. The merger combines all these scores to generate phone labels. The parameters of the detectors and the merger can be optimized either separately or jointly, and we showed that penalized logistic regression machine (PLRM) is a convenient tool for joint optimization. We validated our approach on a rescoring scheme. In this work, we tackle the phone classification problem and show that high level phone accuracy can be achieved without a direct modeling of the phones when PLRM is used. We also show that better results can be obtained by increasing the number of phonetic features, and that our method outperforms phone classifiers trained either by maximum likelihood estimation, or maximum mutual information

S. M. SINISCALCHI, SVENDSEN T, LEE C.-H (2008). A penalized logistic regression approach to detection based phone classification. In Interspeech 2008 (pp. 2390-2393) [10.21437/Interspeech.2008-126].

A penalized logistic regression approach to detection based phone classification

S. M. SINISCALCHI^{Primo

Investigation};SVENDSEN T;LEE C.-H

2008-01-01

Abstract

Recently, we have proposed a detection-based speech recognizer which has two main components: a bank of phonetic feature detectors implemented with hidden Markov models (HMMs), and an event merger. Each detector generates a score that pertains to some phonetic features, e.g. voicing. The merger combines all these scores to generate phone labels. The parameters of the detectors and the merger can be optimized either separately or jointly, and we showed that penalized logistic regression machine (PLRM) is a convenient tool for joint optimization. We validated our approach on a rescoring scheme. In this work, we tackle the phone classification problem and show that high level phone accuracy can be achieved without a direct modeling of the phones when PLRM is used. We also show that better results can be obtained by increasing the number of phonetic features, and that our method outperforms phone classifiers trained either by maximum likelihood estimation, or maximum mutual information

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				2008
			
	ISBN della monografia 
DATO PREVISTO SU LOGINMIUR
	
				978-1-61567-378-0
			
	DOI del contributo 
DATO PREVISTO SU LOGINMIUR
	
				https://dx.doi.org/10.21437/Interspeech.2008-126
			
	URL dell'editore (Open access ove possibile)
	
				https://www.isca-archive.org/interspeech_2008/siniscalchi08_interspeech.html
			
	Citazione
	
				S. M. SINISCALCHI,  SVENDSEN T,  LEE C.-H (2008). A penalized logistic regression approach to detection based phone classification. In Interspeech 2008 (pp. 2390-2393) [10.21437/Interspeech.2008-126].
			
	Appare nelle tipologie:
	
				2.07 Contributo in atti di convegno pubblicato in volume

File in questo prodotto:

File	Dimensione	Formato
INTERSPEECH_2008_Siniscalchi.pdf Solo gestori archvio Descrizione: Il testo pieno dell’articolo è disponibile al seguente link: https://www.isca-archive.org/interspeech_2008/siniscalchi08_interspeech.html Tipologia: Versione Editoriale Dimensione 241.15 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	241.15 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/649518

Citazioni

ND

6

5

social impact