RLSTM: A Novel Residual and Recurrent Network for Pedestrian Action Classification

Gazzeh, S; Lo Presti, L; Douik, A; La Cascia, M

doi:10.1007/978-3-031-44240-7_6

Properly training LSTMs requires long time and extensive amount of data. To improve the training of these models, this paper proposes a novel residual and recurrent neural network, Resnet-LSTM, for spatio-temporal pedestrian action recognition from image sequences. The model includes a novel layer, called MapGrad, whose goal is improving stationarity of the feature map sequences processed by the ConvLSTM. The paper demonstrates the effectiveness of the proposed model and the MapGrad layer in the spatio-temporal classification of pedestrian actions through an ablation study and comparison with state-of-the-art methods. Overall, RLSTM achieves an accuracy value of 88% and an average precision of 94% on the JAAD dataset, which is a widely used benchmark in the field. Finally, the paper empirically analyzes the effect of increasing input sequence length on standing action recognition, showing that the proposed method yields a recall of 93%.

Gazzeh, S., Lo Presti, L., Douik, A., La Cascia, M. (2023). RLSTM: A Novel Residual and Recurrent Network for Pedestrian Action Classification. In Computer Analysis of Images and Patterns, Proceedings, Part II, CAIP 2023 (pp. 55-64) [10.1007/978-3-031-44240-7_6].

RLSTM: A Novel Residual and Recurrent Network for Pedestrian Action Classification

Gazzeh, Soulayma;Lo Presti, Liliana;Douik, Ali;La Cascia, Marco

2023-09-01

Abstract

Properly training LSTMs requires long time and extensive amount of data. To improve the training of these models, this paper proposes a novel residual and recurrent neural network, Resnet-LSTM, for spatio-temporal pedestrian action recognition from image sequences. The model includes a novel layer, called MapGrad, whose goal is improving stationarity of the feature map sequences processed by the ConvLSTM. The paper demonstrates the effectiveness of the proposed model and the MapGrad layer in the spatio-temporal classification of pedestrian actions through an ablation study and comparison with state-of-the-art methods. Overall, RLSTM achieves an accuracy value of 88% and an average precision of 94% on the JAAD dataset, which is a widely used benchmark in the field. Finally, the paper empirically analyzes the effect of increasing input sequence length on standing action recognition, showing that the proposed method yields a recall of 93%.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				set-2023
			
	ISBN della monografia 
DATO PREVISTO SU LOGINMIUR
	
				978-3-031-44239-1
978-3-031-44240-7
			
	DOI del contributo 
DATO PREVISTO SU LOGINMIUR
	
				https://dx.doi.org/10.1007/978-3-031-44240-7_6
			
	Citazione
	
				Gazzeh, S., Lo Presti, L., Douik, A., La Cascia, M. (2023). RLSTM: A Novel Residual and Recurrent Network for Pedestrian Action Classification. In Computer Analysis of Images and Patterns, Proceedings, Part II, CAIP 2023 (pp. 55-64) [10.1007/978-3-031-44240-7_6].
			
	Appare nelle tipologie:
	
				2.07 Contributo in atti di convegno pubblicato in volume

File in questo prodotto:

File	Dimensione	Formato
CAIP_2023.pdf Solo gestori archvio Descrizione: Articolo Tipologia: Versione Editoriale Dimensione 1.02 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.02 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/610419

Citazioni

ND

0

0

Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

RLSTM: A Novel Residual and Recurrent Network for Pedestrian Action Classification

Gazzeh, Soulayma;Lo Presti, Liliana;Douik, Ali;La Cascia, Marco

2023-09-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

RLSTM: A Novel Residual and Recurrent Network for Pedestrian Action Classification

Gazzeh, Soulayma;Lo Presti, Liliana;Douik, Ali;La Cascia, Marco

2023-09-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)