In this paper, a multimodal system for recognizing people in intelligent environments is presented. Users are identified and tracked by detecting and recognizing voices and faces through cameras and microphones spread around the environment. This multimodal approach has been chosen to develop a flexible and cheap though reliable system, implemented through consumer electronics. Voice features are extracted through a short time spectrum analysis, while face features are extracted using the eigenfaces technique. The recognition task is achieved through the use of some Support Vector Machines, one per modality, that learn and classify the features of each person, while bindings between modalities are also learnt through a cross-anchoring learning rule based on the mutual exclusivity selection principle. The system has been developed using NMM, a middleware software capable of splitting the sensors processing in several software nodes, making the system scalable in the number of cameras and microphones
Anzalone, S., Menegatti, E., Pagello, E., Sorbello, R., Yoshikawa, Y., & Ishiguro, H. (2011). A Multimodal People Recognition System for an Intelligent Environment. In R. Pirrone, & F. Sorbello (a cura di), AI*IA 2011: Artificial Intelligence Around Man and Beyond (pp. 451-456). Springer Berlin Heidelberg.
Data di pubblicazione: | 2011 |
Titolo: | A Multimodal People Recognition System for an Intelligent Environment |
Autori: | |
Citazione: | Anzalone, S., Menegatti, E., Pagello, E., Sorbello, R., Yoshikawa, Y., & Ishiguro, H. (2011). A Multimodal People Recognition System for an Intelligent Environment. In R. Pirrone, & F. Sorbello (a cura di), AI*IA 2011: Artificial Intelligence Around Man and Beyond (pp. 451-456). Springer Berlin Heidelberg. |
Abstract: | In this paper, a multimodal system for recognizing people in intelligent environments is presented. Users are identified and tracked by detecting and recognizing voices and faces through cameras and microphones spread around the environment. This multimodal approach has been chosen to develop a flexible and cheap though reliable system, implemented through consumer electronics. Voice features are extracted through a short time spectrum analysis, while face features are extracted using the eigenfaces technique. The recognition task is achieved through the use of some Support Vector Machines, one per modality, that learn and classify the features of each person, while bindings between modalities are also learnt through a cross-anchoring learning rule based on the mutual exclusivity selection principle. The system has been developed using NMM, a middleware software capable of splitting the sensors processing in several software nodes, making the system scalable in the number of cameras and microphones |
Digital Object Identifier (DOI): | http://dx.doi.org/10.1007/978-3-642-23954-0_46 |
URL: | http://link.springer.com/content/pdf/10.1007%2F978-3-642-23954-0_46.pdf |
Settore Scientifico Disciplinare: | Settore ING-INF/05 - Sistemi Di Elaborazione Delle Informazioni |
Appare nelle tipologie: | 2.01 Capitolo o Saggio |
File in questo prodotto:
File | Descrizione | Tipologia | Licenza | |
---|---|---|---|---|
MultimodalPeopleRecognitionSystem.pdf | N/A | Administrator Richiedi una copia |