In this paper we propose a totally unsupervised and automatic illustration method, which aims to find onto the Web a set of images to illustrate the content of an input short text. The text is modelled as a semantic space and a set of relevant keywords is extracted. We compare and discuss different methods to create semantic representations by keyword extraction. Keywords are used to query Google Image Search engine for a list of relevant images. We also extract information from the Web pages that include the retrieved images, to create an Image Semantic Space, which is compared to the Text Semantic Space in order to rank the list of retrieved images. Tests showed that our method achieves very good results, which overcome those obtained by using a state-of-the-art application. Furthermore we developed a Web tool to test our system and evaluate results within the Internet community.
Aramini, S., Ardizzone, E., Mazzola, G. (2015). Automatic Illustration of Short Texts via Web Images. In Proceedings of the 6th International Conference on Information Visualization Theory and Applications (IVAPP-2015) (pp. 139-148). SCITEPRESS [10.5220/0005307301390148].
Automatic Illustration of Short Texts via Web Images
ARAMINI, Sandro Aldo;ARDIZZONE, Edoardo;MAZZOLA, Giuseppe
2015-01-01
Abstract
In this paper we propose a totally unsupervised and automatic illustration method, which aims to find onto the Web a set of images to illustrate the content of an input short text. The text is modelled as a semantic space and a set of relevant keywords is extracted. We compare and discuss different methods to create semantic representations by keyword extraction. Keywords are used to query Google Image Search engine for a list of relevant images. We also extract information from the Web pages that include the retrieved images, to create an Image Semantic Space, which is compared to the Text Semantic Space in order to rank the list of retrieved images. Tests showed that our method achieves very good results, which overcome those obtained by using a state-of-the-art application. Furthermore we developed a Web tool to test our system and evaluate results within the Internet community.File | Dimensione | Formato | |
---|---|---|---|
53073.pdf
accesso aperto
Descrizione: Articolo principale
Tipologia:
Versione Editoriale
Dimensione
1.63 MB
Formato
Adobe PDF
|
1.63 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.