Over the last 40 years, automatic solutions to analyze text documents collection have been one of the most attractive challenges in the field of information retrieval. More recently, the focus has moved towards dynamic, distributed environments, where documents are continuously created by the users of a virtual community, i.e., the social network. In the case of Twitter, such documents, called tweets, are usually related to events which involve many people in different parts of the world. In this work we present a system for real-time Twitter data analysis which allows to follow a generic event from the user's point of view. The topic detection algorithm we propose is an improved version of the Soft Frequent Pattern Mining algorithm, designed to deal with dynamic environments. In particular, in order to obtain prompt results, the whole Twitter stream is split in dynamic windows whose size depends both on the volume of tweets and time. Moreover, the set of terms we use to query Twitter is progressively refined to include new relevant keywords which point out the emergence of new subtopics or new trends in the main topic. Tests have been performed to evaluate the performance of the framework and experimental results show the effectiveness of our solution.
Morana, M., Lo Re, G., Gaglio, S. (2015). Real-Time Detection of Twitter Social Events from the User's Perspective. In 2015 IEEE International Conference on Communications (ICC) (pp. 1207-1212). 345 E 47TH ST, NEW YORK, NY 10017 USA : IEEE [10.1109/ICC.2015.7248487].
Real-Time Detection of Twitter Social Events from the User's Perspective
MORANA, Marco;LO RE, Giuseppe;GAGLIO, Salvatore
2015-01-01
Abstract
Over the last 40 years, automatic solutions to analyze text documents collection have been one of the most attractive challenges in the field of information retrieval. More recently, the focus has moved towards dynamic, distributed environments, where documents are continuously created by the users of a virtual community, i.e., the social network. In the case of Twitter, such documents, called tweets, are usually related to events which involve many people in different parts of the world. In this work we present a system for real-time Twitter data analysis which allows to follow a generic event from the user's point of view. The topic detection algorithm we propose is an improved version of the Soft Frequent Pattern Mining algorithm, designed to deal with dynamic environments. In particular, in order to obtain prompt results, the whole Twitter stream is split in dynamic windows whose size depends both on the volume of tweets and time. Moreover, the set of terms we use to query Twitter is progressively refined to include new relevant keywords which point out the emergence of new subtopics or new trends in the main topic. Tests have been performed to evaluate the performance of the framework and experimental results show the effectiveness of our solution.File | Dimensione | Formato | |
---|---|---|---|
07248487.pdf
Solo gestori archvio
Descrizione: Paper
Tipologia:
Versione Editoriale
Dimensione
364.98 kB
Formato
Adobe PDF
|
364.98 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
icc cover.pdf
Solo gestori archvio
Descrizione: gerenza
Tipologia:
Altro materiale (es. dati della ricerca)
Dimensione
32.83 kB
Formato
Adobe PDF
|
32.83 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
indice_icc.pdf
Solo gestori archvio
Descrizione: indice
Tipologia:
Altro materiale (es. dati della ricerca)
Dimensione
415.59 kB
Formato
Adobe PDF
|
415.59 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.