This paper proposes a data science approach based on Benford's Law to analyse tourist flows – being tourism a relevant economic sector in Sicily. In particular, we are interested in detecting irregular patterns in the numerical data that may represent manipulations, inaccuracies or biases in the self-reported data from tourism organisations. The analysis is carried out by using monthly data for arrivals and overnight stays in hotels, B&Bs, and complementary accommodations in the seven provinces of the island from January 2016 to December 2019. We perform the analysis by employing several statistical tests and through a visual inspection of the difference between the empirical distributions and the theoretical Benford's. Conformity to Benford's distribution is mostly confirmed for the total number of overnight stays and the data considered on a yearly basis. On the contrary, we found evident deviations from Benford's Law in the empirical distribution of data broken down by nationality of tourist and accommodation type. Some comments on possible motivations for such deviations are also advanced, even though a detailed exploration of them deserves a devoted study.
Roy Cerqueti, Davide Provenzano (2023). Benford's Law for economic data reliability: The case of tourism flows in Sicily. CHAOS, SOLITONS AND FRACTALS, 173 [10.1016/j.chaos.2023.113635].
Benford's Law for economic data reliability: The case of tourism flows in Sicily
Davide Provenzano
2023-06-01
Abstract
This paper proposes a data science approach based on Benford's Law to analyse tourist flows – being tourism a relevant economic sector in Sicily. In particular, we are interested in detecting irregular patterns in the numerical data that may represent manipulations, inaccuracies or biases in the self-reported data from tourism organisations. The analysis is carried out by using monthly data for arrivals and overnight stays in hotels, B&Bs, and complementary accommodations in the seven provinces of the island from January 2016 to December 2019. We perform the analysis by employing several statistical tests and through a visual inspection of the difference between the empirical distributions and the theoretical Benford's. Conformity to Benford's distribution is mostly confirmed for the total number of overnight stays and the data considered on a yearly basis. On the contrary, we found evident deviations from Benford's Law in the empirical distribution of data broken down by nationality of tourist and accommodation type. Some comments on possible motivations for such deviations are also advanced, even though a detailed exploration of them deserves a devoted study.File | Dimensione | Formato | |
---|---|---|---|
CHAOS_113635.pdf
Solo gestori archvio
Tipologia:
Post-print
Dimensione
935.62 kB
Formato
Adobe PDF
|
935.62 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
1-s2.0-S0960077923005362-main.pdf
Solo gestori archvio
Tipologia:
Versione Editoriale
Dimensione
1.73 MB
Formato
Adobe PDF
|
1.73 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.