Irony and sarcasm are two complex linguistic phenomena that are widely used in everyday language and especially over the social media, but they represent two serious issues for automated text understanding. Many labeled corpora have been extracted from several sources to accomplish this task, and it seems that sarcasm is conveyed in different ways for different domains. Nonetheless, very little work has been done for comparing different methods among the available corpora. Furthermore, usually, each author collects and uses their own datasets to evaluate his own method. In this paper, we show that sarcasm detection can be tackled by applying classical machine-learning algorithms to input texts sub-symbolically represented in a Latent Semantic space. The main consequence is that our studies establish both reference datasets and baselines for the sarcasm detection problem that could serve the scientific community to test newly proposed methods
Di Gangi, M.A., Lo Bosco, G., Pilato, G. (2019). Effectiveness of data-driven induction of semantic spaces and traditional classifiers for sarcasm detection. NATURAL LANGUAGE ENGINEERING, 25(2), 257-285 [10.1017/S1351324919000019].
Effectiveness of data-driven induction of semantic spaces and traditional classifiers for sarcasm detection
Lo Bosco, Giosué
;Pilato, Giovanni
2019-01-01
Abstract
Irony and sarcasm are two complex linguistic phenomena that are widely used in everyday language and especially over the social media, but they represent two serious issues for automated text understanding. Many labeled corpora have been extracted from several sources to accomplish this task, and it seems that sarcasm is conveyed in different ways for different domains. Nonetheless, very little work has been done for comparing different methods among the available corpora. Furthermore, usually, each author collects and uses their own datasets to evaluate his own method. In this paper, we show that sarcasm detection can be tackled by applying classical machine-learning algorithms to input texts sub-symbolically represented in a Latent Semantic space. The main consequence is that our studies establish both reference datasets and baselines for the sarcasm detection problem that could serve the scientific community to test newly proposed methodsFile | Dimensione | Formato | |
---|---|---|---|
effectiveness_of_datadriven_induction_of_semantic_spaces_and_traditional_classifiers_for_sarcasm_detection.pdf
Solo gestori archvio
Tipologia:
Versione Editoriale
Dimensione
620.97 kB
Formato
Adobe PDF
|
620.97 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
post_print_effectiveness_of_datadriven_induction_of_semantic_spaces_and_traditional_classifiers_for_sarcasm_detection.pdf
Open Access dal 04/07/2019
Tipologia:
Post-print
Dimensione
686.27 kB
Formato
Adobe PDF
|
686.27 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.