Automatic Text Complexity Evaluation (ATE) is a natural language processing task which aims to assess texts difficulty taking into account many facets related to complexity. A large number of papers tackle the problem of ATE by means of machine learning algorithms in order to classify texts into complex or simple classes. In this paper, we try to go beyond the methodologies presented so far by introducing a preliminary system based on a deep neural network model whose objective is to classify sentences into more of two classes. Experiments have been carried out on a manually annotated corpus which has been preprocessed in order to make it suitable for the scope of the paper. The results show that a higher detail level of the classification makes the ATE problem much harder to resolve, showing the weaknesses of the model to accomplish the task correctly.
Cuzzocrea, A., Lo Bosco, G., Pilato, G., Schicchi, D. (2019). Multi-class Text Complexity Evaluation via Deep Neural Networks. In H. Yin, D. Camacho, P. Tino, A.J. Tallón-Ballesteros, R. Menezes, R. Allmendinger (a cura di), Intelligent Data Engineering and Automated Learning – IDEAL 2019, 20th International Conference Manchester, UK, November 14–16, 2019 Proceedings, Part II (pp. 313-322) [10.1007/978-3-030-33617-2_32].
Multi-class Text Complexity Evaluation via Deep Neural Networks
Lo Bosco, Giosué;Pilato, Giovanni;Schicchi, Daniele
2019-01-01
Abstract
Automatic Text Complexity Evaluation (ATE) is a natural language processing task which aims to assess texts difficulty taking into account many facets related to complexity. A large number of papers tackle the problem of ATE by means of machine learning algorithms in order to classify texts into complex or simple classes. In this paper, we try to go beyond the methodologies presented so far by introducing a preliminary system based on a deep neural network model whose objective is to classify sentences into more of two classes. Experiments have been carried out on a manually annotated corpus which has been preprocessed in order to make it suitable for the scope of the paper. The results show that a higher detail level of the classification makes the ATE problem much harder to resolve, showing the weaknesses of the model to accomplish the task correctly.File | Dimensione | Formato | |
---|---|---|---|
Lo_Bosco_SChicci_IDEAL.pdf
Solo gestori archvio
Tipologia:
Versione Editoriale
Dimensione
136.51 kB
Formato
Adobe PDF
|
136.51 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.