Speech signal contains rich information encompassing gender, accent, speaking environment, and other speaker characteristics. Meanwhile, deploying high-performance speech applications often requires a large amount of training speech data, which are often collected from end-users. Therefore, protecting data privacy becomes a rising concern when speech data are employed to deploy commercial speech applications. That motivates the rising interest in designing “federated learning” for voice assistants and mobile applications. This chapter will introduce recent advances in federated learning foundation algorithms and applications for speech recognition, and general acoustic processing. Furthermore, it will introduce how federated learning-based speech processing techniques (e.g., average gradient and teacher–student learning) would connect to some critical data protection guidelines and public regulations, such as European Union's General Data Protection Regulation (GDPR) and California Consumer Privacy Act (CCPA).

Yang C.-H.H., Siniscalchi S.M. (2024). Federated learning for privacy-preserving speech recognition. In Federated Learning: Theory and Practice (pp. 353-368). Elsevier [10.1016/B978-0-44-319037-7.00030-2].

Federated learning for privacy-preserving speech recognition

Siniscalchi S. M.
Writing – Original Draft Preparation
2024-02-24

Abstract

Speech signal contains rich information encompassing gender, accent, speaking environment, and other speaker characteristics. Meanwhile, deploying high-performance speech applications often requires a large amount of training speech data, which are often collected from end-users. Therefore, protecting data privacy becomes a rising concern when speech data are employed to deploy commercial speech applications. That motivates the rising interest in designing “federated learning” for voice assistants and mobile applications. This chapter will introduce recent advances in federated learning foundation algorithms and applications for speech recognition, and general acoustic processing. Furthermore, it will introduce how federated learning-based speech processing techniques (e.g., average gradient and teacher–student learning) would connect to some critical data protection guidelines and public regulations, such as European Union's General Data Protection Regulation (GDPR) and California Consumer Privacy Act (CCPA).
24-feb-2024
Yang C.-H.H., Siniscalchi S.M. (2024). Federated learning for privacy-preserving speech recognition. In Federated Learning: Theory and Practice (pp. 353-368). Elsevier [10.1016/B978-0-44-319037-7.00030-2].
File in questo prodotto:
File Dimensione Formato  
elsevierbook_template_speech_w_marco_2022_v2.pdf

Solo gestori archvio

Descrizione: pre-print
Tipologia: Pre-print
Dimensione 828.06 kB
Formato Adobe PDF
828.06 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/637516
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact