Text simplification is a non-trivial task that aims at reducing the linguistic complexity of written texts. Researchers have studied the problem by proposing new methodologies for addressing the English language, but other languages, like the Italian one, are almost unexplored. In this paper, we give a contribution to the enhancement of the Automated Text Simplification research by presenting a deep learning-based system, inspired by a state of the art system for the English language, capable of simplifying Italian texts. The system has been trained and tested by leveraging the Italian version of Newsela; it has shown promising results by achieving a SARI value of 30.17.
Megna A.L., Schicchi D., Lo Bosco G., Pilato G. (2021). A Controllable Text Simplification System for the Italian Language. In Proceedings - 2021 IEEE 15th International Conference on Semantic Computing, ICSC 2021 (pp. 191-194). Institute of Electrical and Electronics Engineers Inc. [10.1109/ICSC50631.2021.00040].
A Controllable Text Simplification System for the Italian Language
Schicchi D.
;Lo Bosco G.;Pilato G.
2021-01-01
Abstract
Text simplification is a non-trivial task that aims at reducing the linguistic complexity of written texts. Researchers have studied the problem by proposing new methodologies for addressing the English language, but other languages, like the Italian one, are almost unexplored. In this paper, we give a contribution to the enhancement of the Automated Text Simplification research by presenting a deep learning-based system, inspired by a state of the art system for the English language, capable of simplifying Italian texts. The system has been trained and tested by leveraging the Italian version of Newsela; it has shown promising results by achieving a SARI value of 30.17.File | Dimensione | Formato | |
---|---|---|---|
09364415.pdf
Solo gestori archvio
Descrizione: articolo principale
Tipologia:
Versione Editoriale
Dimensione
110.48 kB
Formato
Adobe PDF
|
110.48 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.