Nowadays, interactive multimedia systems are part of everyday life. The most common way to interact and control these devices is through remote controls or some sort of touch panel. In recent years, due to the introduction of reliable low-cost Kinect-like sensing technology, more and more attention has been dedicated to touchless interfaces. A Kinect-like devices can be positioned on top of a multimedia system, detect a person in front of the system and process skeletal data, optionally with RGBd data, to determine user gestures. The gestures of the person can then be used to control, for example, a media device. Even though there is a lot of interest in this area, currently, no consumer system is using this type of interaction probably due to the inherent difficulties in processing raw data coming from Kinect cameras to detect the user intentions. In this work, we considered the use of neural networks using as input only the Kinect skeletal data for the task of user intention classification. We compared different deep networks and analyzed their outputs.

marco la cascia, i.i. (2019). Recognition of Human Actions Through Deep Neural Networks for Multimedia Systems Interaction. In MMEDIA 2019 : The Eleventh International Conference on Advances in Multimedia (pp. 71-76). IARIA.

Recognition of Human Actions Through Deep Neural Networks for Multimedia Systems Interaction

marco la cascia
;
ignazio infantino;filippo vella
2019-01-01

Abstract

Nowadays, interactive multimedia systems are part of everyday life. The most common way to interact and control these devices is through remote controls or some sort of touch panel. In recent years, due to the introduction of reliable low-cost Kinect-like sensing technology, more and more attention has been dedicated to touchless interfaces. A Kinect-like devices can be positioned on top of a multimedia system, detect a person in front of the system and process skeletal data, optionally with RGBd data, to determine user gestures. The gestures of the person can then be used to control, for example, a media device. Even though there is a lot of interest in this area, currently, no consumer system is using this type of interaction probably due to the inherent difficulties in processing raw data coming from Kinect cameras to detect the user intentions. In this work, we considered the use of neural networks using as input only the Kinect skeletal data for the task of user intention classification. We compared different deep networks and analyzed their outputs.
2019
978-1-61208-697-2
marco la cascia, i.i. (2019). Recognition of Human Actions Through Deep Neural Networks for Multimedia Systems Interaction. In MMEDIA 2019 : The Eleventh International Conference on Advances in Multimedia (pp. 71-76). IARIA.
File in questo prodotto:
File Dimensione Formato  
mmedia_2019_5_20_50048.pdf

accesso aperto

Descrizione: Main paper
Tipologia: Versione Editoriale
Dimensione 600.48 kB
Formato Adobe PDF
600.48 kB Adobe PDF Visualizza/Apri
mmedia_2019.pdf

accesso aperto

Descrizione: Cover e index
Dimensione 336.8 kB
Formato Adobe PDF
336.8 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/349579
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact