Speech signal contains rich information encompassing gender, accent, speaking environment, and other speaker characteristics. Meanwhile, deploying high-performance speech applications often requires a large amount of training speech data, which are often collected from end-users. Therefore, protecting data privacy becomes a rising concern when speech data are employed to deploy commercial speech applications. That motivates the rising interest in designing “federated learning” for voice assistants and mobile applications. This chapter will introduce recent advances in federated learning foundation algorithms and applications for speech recognition, and general acoustic processing. Furthermore, it will introduce how federated learning-based speech processing techniques (e.g., average gradient and teacher–student learning) would connect to some critical data protection guidelines and public regulations, such as European Union's General Data Protection Regulation (GDPR) and California Consumer Privacy Act (CCPA).
Yang C.-H.H., Siniscalchi S.M. (2024). Federated learning for privacy-preserving speech recognition. In Federated Learning: Theory and Practice (pp. 353-368). Elsevier [10.1016/B978-0-44-319037-7.00030-2].
Federated learning for privacy-preserving speech recognition
Siniscalchi S. M.Writing – Original Draft Preparation
2024-02-24
Abstract
Speech signal contains rich information encompassing gender, accent, speaking environment, and other speaker characteristics. Meanwhile, deploying high-performance speech applications often requires a large amount of training speech data, which are often collected from end-users. Therefore, protecting data privacy becomes a rising concern when speech data are employed to deploy commercial speech applications. That motivates the rising interest in designing “federated learning” for voice assistants and mobile applications. This chapter will introduce recent advances in federated learning foundation algorithms and applications for speech recognition, and general acoustic processing. Furthermore, it will introduce how federated learning-based speech processing techniques (e.g., average gradient and teacher–student learning) would connect to some critical data protection guidelines and public regulations, such as European Union's General Data Protection Regulation (GDPR) and California Consumer Privacy Act (CCPA).File | Dimensione | Formato | |
---|---|---|---|
elsevierbook_template_speech_w_marco_2022_v2.pdf
Solo gestori archvio
Descrizione: pre-print
Tipologia:
Pre-print
Dimensione
828.06 kB
Formato
Adobe PDF
|
828.06 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.