In this paper, we present a method for recognizing human activities using information sensed by an RGB-D camera, namely the Microsoft Kinect. Our approach is based on the estimation of some relevant joints of the human body by means of the Kinect; three different machine learning techniques, i.e., K-means clustering, support vector machines, and hidden Markov models, are combined to detect the postures involved while performing an activity, to classify them, and to model each activity as a spatiotemporal evolution of known postures. Experiments were performed on Kinect Activity Recognition Dataset, a new dataset, and on CAD-60, a public dataset. Experimental results show that our solution outperforms four relevant works based on RGB-D image fusion, hierarchical Maximum Entropy Markov Model, Markov Random Fields, and Eigenjoints, respectively. The performance we achieved, i.e., precision/recall of 77.3% and 76.7%, and the ability to recognize the activities in real time show promise for applied use.

Gaglio, S., Lo Re, G., Morana, M. (2015). Human Activity Recognition Process Using 3-D Posture Data. IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 45(5), 586-597 [10.1109/THMS.2014.2377111].

Human Activity Recognition Process Using 3-D Posture Data

GAGLIO, Salvatore;LO RE, Giuseppe;MORANA, Marco
2015-01-01

Abstract

In this paper, we present a method for recognizing human activities using information sensed by an RGB-D camera, namely the Microsoft Kinect. Our approach is based on the estimation of some relevant joints of the human body by means of the Kinect; three different machine learning techniques, i.e., K-means clustering, support vector machines, and hidden Markov models, are combined to detect the postures involved while performing an activity, to classify them, and to model each activity as a spatiotemporal evolution of known postures. Experiments were performed on Kinect Activity Recognition Dataset, a new dataset, and on CAD-60, a public dataset. Experimental results show that our solution outperforms four relevant works based on RGB-D image fusion, hierarchical Maximum Entropy Markov Model, Markov Random Fields, and Eigenjoints, respectively. The performance we achieved, i.e., precision/recall of 77.3% and 76.7%, and the ability to recognize the activities in real time show promise for applied use.
2015
Settore ING-INF/05 - Sistemi Di Elaborazione Delle Informazioni
Gaglio, S., Lo Re, G., Morana, M. (2015). Human Activity Recognition Process Using 3-D Posture Data. IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 45(5), 586-597 [10.1109/THMS.2014.2377111].
File in questo prodotto:
File Dimensione Formato  
06990523.pdf

Solo gestori archvio

Descrizione: articolo principale
Dimensione 613.52 kB
Formato Adobe PDF
613.52 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/103781
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 284
  • ???jsp.display-item.citation.isi??? 232
social impact