Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

Irony and sarcasm are two complex linguistic phenomena that are widely used in everyday language and especially over the social media, but they represent two serious issues for automated text understanding. Many labeled corpora have been extracted from several sources to accomplish this task, and it seems that sarcasm is conveyed in different ways for different domains. Nonetheless, very little work has been done for comparing different methods among the available corpora. Furthermore, usually, each author collects and uses their own datasets to evaluate his own method. In this paper, we show that sarcasm detection can be tackled by applying classical machine-learning algorithms to input texts sub-symbolically represented in a Latent Semantic space. The main consequence is that our studies establish both reference datasets and baselines for the sarcasm detection problem that could serve the scientific community to test newly proposed methods

Di Gangi, M.A., Lo Bosco, G., Pilato, G. (2019). Effectiveness of data-driven induction of semantic spaces and traditional classifiers for sarcasm detection. NATURAL LANGUAGE ENGINEERING, 25(2), 257-285 [10.1017/S1351324919000019].

Effectiveness of data-driven induction of semantic spaces and traditional classifiers for sarcasm detection

Di Gangi, Mattia Antonino;Lo Bosco, Giosué;Pilato, Giovanni

2019-01-01

Abstract

Irony and sarcasm are two complex linguistic phenomena that are widely used in everyday language and especially over the social media, but they represent two serious issues for automated text understanding. Many labeled corpora have been extracted from several sources to accomplish this task, and it seems that sarcasm is conveyed in different ways for different domains. Nonetheless, very little work has been done for comparing different methods among the available corpora. Furthermore, usually, each author collects and uses their own datasets to evaluate his own method. In this paper, we show that sarcasm detection can be tackled by applying classical machine-learning algorithms to input texts sub-symbolically represented in a Latent Semantic space. The main consequence is that our studies establish both reference datasets and baselines for the sarcasm detection problem that could serve the scientific community to test newly proposed methods

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
			2019
		
	Settore scientifico disciplinare del contributo
	
			Settore INF/01 - Informatica
		
	Titolo del periodico 
DATO PREVISTO SU LOGINMIUR
	
			NATURAL LANGUAGE ENGINEERING
		
	DOI del contributo 
DATO PREVISTO SU LOGINMIUR
	
			https://dx.doi.org/10.1017/S1351324919000019
		
	Citazione
	
			Di Gangi, M.A., Lo Bosco, G., Pilato, G. (2019). Effectiveness of data-driven induction of semantic spaces and traditional classifiers for sarcasm detection. NATURAL LANGUAGE ENGINEERING, 25(2), 257-285 [10.1017/S1351324919000019].
		
	Appare nelle tipologie:
	
			1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
effectiveness_of_datadriven_induction_of_semantic_spaces_and_traditional_classifiers_for_sarcasm_detection.pdf Solo gestori archvio Tipologia: Versione Editoriale Dimensione 620.97 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	620.97 kB	Adobe PDF	Visualizza/Apri Richiedi una copia
post_print_effectiveness_of_datadriven_induction_of_semantic_spaces_and_traditional_classifiers_for_sarcasm_detection.pdf Open Access dal 04/07/2019 Tipologia: Post-print Dimensione 686.27 kB Formato Adobe PDF Visualizza/Apri	686.27 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/349361

Citazioni

ND

14

13

social impact