In this paper, a multimodal system for recognizing people in intelligent environments is presented. Users are identified and tracked by detecting and recognizing voices and faces through cameras and microphones spread around the environment. This multimodal approach has been chosen to develop a flexible and cheap though reliable system, implemented through consumer electronics. Voice features are extracted through a short time spectrum analysis, while face features are extracted using the eigenfaces technique. The recognition task is achieved through the use of some Support Vector Machines, one per modality, that learn and classify the features of each person, while bindings between modalities are also learnt through a cross-anchoring learning rule based on the mutual exclusivity selection principle. The system has been developed using NMM, a middleware software capable of splitting the sensors processing in several software nodes, making the system scalable in the number of cameras and microphones

Anzalone, S.M., Menegatti, E., Pagello, E., Sorbello, R., Yoshikawa, Y., Ishiguro, H. (2011). A Multimodal People Recognition System for an Intelligent Environment. In R. Pirrone, F. Sorbello (a cura di), AI*IA 2011: Artificial Intelligence Around Man and Beyond (pp. 451-456). Springer Berlin Heidelberg [10.1007/978-3-642-23954-0_46].

A Multimodal People Recognition System for an Intelligent Environment

SORBELLO, Rosario;
2011-01-01

Abstract

In this paper, a multimodal system for recognizing people in intelligent environments is presented. Users are identified and tracked by detecting and recognizing voices and faces through cameras and microphones spread around the environment. This multimodal approach has been chosen to develop a flexible and cheap though reliable system, implemented through consumer electronics. Voice features are extracted through a short time spectrum analysis, while face features are extracted using the eigenfaces technique. The recognition task is achieved through the use of some Support Vector Machines, one per modality, that learn and classify the features of each person, while bindings between modalities are also learnt through a cross-anchoring learning rule based on the mutual exclusivity selection principle. The system has been developed using NMM, a middleware software capable of splitting the sensors processing in several software nodes, making the system scalable in the number of cameras and microphones
2011
Settore ING-INF/05 - Sistemi Di Elaborazione Delle Informazioni
Anzalone, S.M., Menegatti, E., Pagello, E., Sorbello, R., Yoshikawa, Y., Ishiguro, H. (2011). A Multimodal People Recognition System for an Intelligent Environment. In R. Pirrone, F. Sorbello (a cura di), AI*IA 2011: Artificial Intelligence Around Man and Beyond (pp. 451-456). Springer Berlin Heidelberg [10.1007/978-3-642-23954-0_46].
File in questo prodotto:
File Dimensione Formato  
MultimodalPeopleRecognitionSystem.pdf

Solo gestori archvio

Dimensione 322.74 kB
Formato Adobe PDF
322.74 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/77378
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact