Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

In this paper we describe a deep learning model based on a Data Augmentation (DA) layer followed by a Convolutional Neural Network (CNN). The proposed model was developed by our team for the Profiling Irony and Stereotype Spreaders (ISSs) task proposed by the PAN 2022 organizers. As a first step, to classify an author as ISS or not (nISS), we developed a DA layer that expands each sample in the dataset provided. Using this augmented dataset we trained the CNN. Then, to submit our predictions, we apply our DA layer on the samples within the unlabeled test set too. Finally we fed our trained CNN with the augmented test set to generate our final predictions. To develop and test our model we used a 5-fold cross validation on the labelled training set. The proposed model reaches a maximum accuracy of 0.92 and an average accuracy of 0.89 over the five folds. Meanwhile, on the provided test set the proposed model reaches an accuracy of 0.9278.

Mangione S., Siino M., Garbo G. (2022). Improving Irony and Stereotype Spreaders Detection using Data Augmentation and Convolutional Neural Network. In CEUR Workshop Proceedings (pp. 2585-2593). CEUR-WS.

Improving Irony and Stereotype Spreaders Detection using Data Augmentation and Convolutional Neural Network

Mangione S.^Secondo;Siino M.^Primo;Garbo G.^Ultimo

2022-01-01

Abstract

In this paper we describe a deep learning model based on a Data Augmentation (DA) layer followed by a Convolutional Neural Network (CNN). The proposed model was developed by our team for the Profiling Irony and Stereotype Spreaders (ISSs) task proposed by the PAN 2022 organizers. As a first step, to classify an author as ISS or not (nISS), we developed a DA layer that expands each sample in the dataset provided. Using this augmented dataset we trained the CNN. Then, to submit our predictions, we apply our DA layer on the samples within the unlabeled test set too. Finally we fed our trained CNN with the augmented test set to generate our final predictions. To develop and test our model we used a 5-fold cross validation on the labelled training set. The proposed model reaches a maximum accuracy of 0.92 and an average accuracy of 0.89 over the five folds. Meanwhile, on the provided test set the proposed model reaches an accuracy of 0.9278.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				2022
			
	URL dell'editore (Open access ove possibile)
	
				http://ceur-ws.org/Vol-3180/paper-213.pdf
			
	Citazione
	
				Mangione S.,  Siino M.,  Garbo G. (2022). Improving Irony and Stereotype Spreaders Detection using Data Augmentation and Convolutional Neural Network. In CEUR Workshop Proceedings (pp. 2585-2593). CEUR-WS.
			
	Appare nelle tipologie:
	
				2.07 Contributo in atti di convegno pubblicato in volume

File in questo prodotto:

File	Dimensione	Formato
paper-213.pdf accesso aperto Tipologia: Versione Editoriale Dimensione 1.06 MB Formato Adobe PDF Visualizza/Apri	1.06 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/567984

Citazioni

ND

22

ND

social impact