Lipreading which infers spoken content based solely on visual information such as lip movements is crucial in multi-modal research medicine and human-computer interaction. We organized the Chat-scenario Chinese Lipreading (Chat-CLR) challenge focusing on unscripted chat scenarios among native Chinese speakers. We placed emphasis on two tasks wake word lipreading (WWLR) and target speaker lipreading (TSLR). We are dedicated to fulfilling the requirements of waking up smart home devices within household settings and utilizing video for speech recognition with these smart home devices. For the WWLR task we received submissions from 5 teams with the top-performing system showing a 71.4% improvement over the baseline system. In the TSLR task we received submissions from 6 teams and the best system achieved a 22.1% improvement compared to the baseline system.

Zhang C.Y., Chen H., Du J., Siniscalchi S.M., Jiang Y., Lee C.H. (2024). Summary on the Chat-Scenario Chinese Lipreading (ChatCLR) Challenge. In 2024 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) (pp. 1-6) [10.1109/ICMEW63481.2024.10645486].

Summary on the Chat-Scenario Chinese Lipreading (ChatCLR) Challenge

Siniscalchi S. M.;
2024-01-01

Abstract

Lipreading which infers spoken content based solely on visual information such as lip movements is crucial in multi-modal research medicine and human-computer interaction. We organized the Chat-scenario Chinese Lipreading (Chat-CLR) challenge focusing on unscripted chat scenarios among native Chinese speakers. We placed emphasis on two tasks wake word lipreading (WWLR) and target speaker lipreading (TSLR). We are dedicated to fulfilling the requirements of waking up smart home devices within household settings and utilizing video for speech recognition with these smart home devices. For the WWLR task we received submissions from 5 teams with the top-performing system showing a 71.4% improvement over the baseline system. In the TSLR task we received submissions from 6 teams and the best system achieved a 22.1% improvement compared to the baseline system.
2024
Settore IINF-05/A - Sistemi di elaborazione delle informazioni
979-8-3503-7981-5
Zhang C.Y., Chen H., Du J., Siniscalchi S.M., Jiang Y., Lee C.H. (2024). Summary on the Chat-Scenario Chinese Lipreading (ChatCLR) Challenge. In 2024 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) (pp. 1-6) [10.1109/ICMEW63481.2024.10645486].
File in questo prodotto:
File Dimensione Formato  
Summary_on_the_Chat-Scenario_Chinese_Lipreading_ChatCLR_Challenge.pdf

Solo gestori archvio

Tipologia: Versione Editoriale
Dimensione 432 kB
Formato Adobe PDF
432 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/663742
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact