Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

We propose a variational Bayesian (VB) approach to learning distributions of latent variables in deep neural network (DNN) models for cross-domain knowledge transfer, to address acoustic mismatches between training and testing conditions. Instead of carrying out point estimation in conventional maximum a posteriori estimation with a risk of having a curse of dimensionality in estimating a huge number of model parameters, we focus our attention on estimating a manageable number of latent variables of DNNs via a VB inference framework. To accomplish model transfer, knowledge learnt from a source domain is encoded in prior distributions of latent variables and optimally combined, in a Bayesian sense, with a small set of adaptation data from a target domain to approximate the corresponding posterior distributions. Experimental results on device adaptation in acoustic scene classification show that our proposed VB approach can obtain good improvements on target devices, and consistently outperforms 13 state-of-the-art knowledge transfer algorithms.

Hu, H.u., Siniscalchi, S.M., Yang, C.H., Lee, C. (2022). A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer. In 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 4041-4045) [10.1109/ICASSP43922.2022.9746076].

A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer

Siniscalchi, Sabato Marco^Supervision;Yang, Chao-Han Huck;Lee, Chin-Hui

2022-01-01

Abstract

We propose a variational Bayesian (VB) approach to learning distributions of latent variables in deep neural network (DNN) models for cross-domain knowledge transfer, to address acoustic mismatches between training and testing conditions. Instead of carrying out point estimation in conventional maximum a posteriori estimation with a risk of having a curse of dimensionality in estimating a huge number of model parameters, we focus our attention on estimating a manageable number of latent variables of DNNs via a VB inference framework. To accomplish model transfer, knowledge learnt from a source domain is encoded in prior distributions of latent variables and optimally combined, in a Bayesian sense, with a small set of adaptation data from a target domain to approximate the corresponding posterior distributions. Experimental results on device adaptation in acoustic scene classification show that our proposed VB approach can obtain good improvements on target devices, and consistently outperforms 13 state-of-the-art knowledge transfer algorithms.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				2022
			
	ISBN della monografia 
DATO PREVISTO SU LOGINMIUR
	
				978-1-6654-0540-9
			
	DOI del contributo 
DATO PREVISTO SU LOGINMIUR
	
				https://dx.doi.org/10.1109/ICASSP43922.2022.9746076
			
	URL alternativo rispetto a quello dell'editore 
DATO PREVISTO SU LOGINMIUR
	
				https://ieeexplore.ieee.org/abstract/document/9746076
			
	Citazione
	
				Hu, H.u., Siniscalchi, S.M., Yang, C.H., Lee, C. (2022). A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer. In 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 4041-4045) [10.1109/ICASSP43922.2022.9746076].
			
	Appare nelle tipologie:
	
				2.07 Contributo in atti di convegno pubblicato in volume

File in questo prodotto:

File	Dimensione	Formato
A_Variational_Bayesian_Approach_to_Learning_Latent_Variables_for_Acoustic_Knowledge_Transfer-2.pdf Solo gestori archvio Tipologia: Versione Editoriale Dimensione 1.29 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.29 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/636663

Citazioni

ND

4

3

social impact