Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

In this paper we present an experimental study exploiting structural Bayesian adaptation for handling potential mismatches between training and test conditions for real-world applications to be realized in our multilingual very large vocabulary speech recognition (VLVSR) system project sponsored by MOTIE (The Ministry of Trade, Industry and Energy), Republic of Korea. The goal of the project is to construct a national-wide VLVSR cloud service platform for mobile applications. Besides system architecture design issues, at such a large scale, performance robustness problems, caused by mismatches in speakers, tasks, environments, and domains, etc., need to be taken into account very carefully as well. We decide to adopt adaptation, especially the structural MAP, techniques to reduce system accuracy degradation caused by these mismatches. Being part of an ongoing project, we describe how structural MAP approaches can be used for adaptation of both acoustic and language models for our VLVSR systems, and provide convincing experimental results to demonstrate how adaptation can be utilized to bridge the performance gap between the current state-of-the-art and deployable VLVSR systems.

I. Fan Chen, SINISCALCHI, S.M., Seokyong Moon, Daejin Shin, Myong Wan Koo, Minhwa Chung, et al. (2013). An experimental study on structural-MAP approaches to implementing very large vocabulary speech recognition systems for real-world tasks. In 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (pp. 1-10) [10.1109/APSIPA.2013.6694185].

An experimental study on structural-MAP approaches to implementing very large vocabulary speech recognition systems for real-world tasks

I. Fan Chen;SINISCALCHI, SABATO MARCO;Seokyong Moon;Daejin Shin;Myong Wan Koo;Minhwa Chung;Chin Hui Lee

2013-01-01

Abstract

In this paper we present an experimental study exploiting structural Bayesian adaptation for handling potential mismatches between training and test conditions for real-world applications to be realized in our multilingual very large vocabulary speech recognition (VLVSR) system project sponsored by MOTIE (The Ministry of Trade, Industry and Energy), Republic of Korea. The goal of the project is to construct a national-wide VLVSR cloud service platform for mobile applications. Besides system architecture design issues, at such a large scale, performance robustness problems, caused by mismatches in speakers, tasks, environments, and domains, etc., need to be taken into account very carefully as well. We decide to adopt adaptation, especially the structural MAP, techniques to reduce system accuracy degradation caused by these mismatches. Being part of an ongoing project, we describe how structural MAP approaches can be used for adaptation of both acoustic and language models for our VLVSR systems, and provide convincing experimental results to demonstrate how adaptation can be utilized to bridge the performance gap between the current state-of-the-art and deployable VLVSR systems.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				2013
			
	ISBN della monografia 
DATO PREVISTO SU LOGINMIUR
	
				9789869000604
			
	DOI del contributo 
DATO PREVISTO SU LOGINMIUR
	
				https://dx.doi.org/10.1109/APSIPA.2013.6694185
			
	URL alternativo rispetto a quello dell'editore 
DATO PREVISTO SU LOGINMIUR
	
				http://ieeexplore.ieee.org/document/6694185/
			
	Citazione
	
				I. Fan Chen, SINISCALCHI, S.M.,  Seokyong Moon,  Daejin Shin,  Myong Wan Koo,  Minhwa Chung, et al. (2013). An experimental study on structural-MAP approaches to implementing very large vocabulary speech recognition systems for real-world tasks. In 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (pp. 1-10) [10.1109/APSIPA.2013.6694185].
			
	Appare nelle tipologie:
	
				2.07 Contributo in atti di convegno pubblicato in volume

File in questo prodotto:

File	Dimensione	Formato
06694185.pdf Solo gestori archvio Tipologia: Versione Editoriale Dimensione 384.11 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	384.11 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/649515

Citazioni

ND

0

0

social impact