Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

Automatic Speech Attribute Transcription (ASAT), an ITR project sponsored under the NSF grant (IIS-04-27113), and Spoken Information Retrieval by Knowledge Utilization in Statistical Speech Processing (SIRKUS), a project funded by the VERDIKT programme at the Research Council of Norway, are two research projects carried out at Georgia Institute of Technology and at Norwegian University of Science and Technology, respectively, with the purpose of investigating and developing new paradigms for speech recognition that have the capability of bridging the gap between machine and human performance. These projects approach speech recognition from a more linguistic perspective: unlike traditional ASR systems, humans detect acoustic and auditory cues, weigh and combine them to form theories, and then process these cognitive hypotheses until linguistically and pragmatically consistent speech understanding is achieved. A major goal of the ASAT/SIRKUS paradigms is to develop a detection-based approach to automatic speech recognition (ASR) based on attribute detection and knowledge integration. We report on progress of these two projects on two different tasks, namely the cross-language and language universal attribute/phone recognition task, and the language identification (LID) task.

SINISCALCHI, S.M., Torbjorn Svendsen, Chin Hui Lee (2010). A survey on recent progress in the ASAT/SIRKUS paradigm. In ISCSLP (pp. 465-470) [10.1109/ISCSLP.2010.5684480].

A survey on recent progress in the ASAT/SIRKUS paradigm

SINISCALCHI, SABATO MARCO^{Primo

Investigation};Torbjorn Svendsen^Secondo;Chin Hui Lee^Ultimo

2010-01-01

Abstract

Automatic Speech Attribute Transcription (ASAT), an ITR project sponsored under the NSF grant (IIS-04-27113), and Spoken Information Retrieval by Knowledge Utilization in Statistical Speech Processing (SIRKUS), a project funded by the VERDIKT programme at the Research Council of Norway, are two research projects carried out at Georgia Institute of Technology and at Norwegian University of Science and Technology, respectively, with the purpose of investigating and developing new paradigms for speech recognition that have the capability of bridging the gap between machine and human performance. These projects approach speech recognition from a more linguistic perspective: unlike traditional ASR systems, humans detect acoustic and auditory cues, weigh and combine them to form theories, and then process these cognitive hypotheses until linguistically and pragmatically consistent speech understanding is achieved. A major goal of the ASAT/SIRKUS paradigms is to develop a detection-based approach to automatic speech recognition (ASR) based on attribute detection and knowledge integration. We report on progress of these two projects on two different tasks, namely the cross-language and language universal attribute/phone recognition task, and the language identification (LID) task.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				2010
			
	Settore scientifico disciplinare del contributo
	
				Settore IINF-05/A - Sistemi di elaborazione delle informazioni
			
	ISBN della monografia 
DATO PREVISTO SU LOGINMIUR
	
				9781424462445
			
	DOI del contributo 
DATO PREVISTO SU LOGINMIUR
	
				https://dx.doi.org/10.1109/ISCSLP.2010.5684480
			
	URL alternativo rispetto a quello dell'editore 
DATO PREVISTO SU LOGINMIUR
	
				http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5684480
			
	Citazione
	
				SINISCALCHI, S.M.,  Torbjorn Svendsen,  Chin Hui Lee (2010). A survey on recent progress in the ASAT/SIRKUS paradigm. In ISCSLP (pp. 465-470) [10.1109/ISCSLP.2010.5684480].
			
	Appare nelle tipologie:
	
				2.07 Contributo in atti di convegno pubblicato in volume

File in questo prodotto:

File	Dimensione	Formato
ISCSLP_2010.pdf Solo gestori archvio Tipologia: Versione Editoriale Dimensione 188.34 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	188.34 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/664125

Citazioni

ND

0

ND

social impact