This paper proposes a data science approach based on Benford's Law to analyse tourist flows – being tourism a relevant economic sector in Sicily. In particular, we are interested in detecting irregular patterns in the numerical data that may represent manipulations, inaccuracies or biases in the self-reported data from tourism organisations. The analysis is carried out by using monthly data for arrivals and overnight stays in hotels, B&Bs, and complementary accommodations in the seven provinces of the island from January 2016 to December 2019. We perform the analysis by employing several statistical tests and through a visual inspection of the difference between the empirical distributions and the theoretical Benford's. Conformity to Benford's distribution is mostly confirmed for the total number of overnight stays and the data considered on a yearly basis. On the contrary, we found evident deviations from Benford's Law in the empirical distribution of data broken down by nationality of tourist and accommodation type. Some comments on possible motivations for such deviations are also advanced, even though a detailed exploration of them deserves a devoted study.

Roy Cerqueti, Davide Provenzano (2023). Benford's Law for economic data reliability: The case of tourism flows in Sicily. CHAOS, SOLITONS AND FRACTALS [10.1016/j.chaos.2023.113635].

Benford's Law for economic data reliability: The case of tourism flows in Sicily

Davide Provenzano
2023-06-01

Abstract

This paper proposes a data science approach based on Benford's Law to analyse tourist flows – being tourism a relevant economic sector in Sicily. In particular, we are interested in detecting irregular patterns in the numerical data that may represent manipulations, inaccuracies or biases in the self-reported data from tourism organisations. The analysis is carried out by using monthly data for arrivals and overnight stays in hotels, B&Bs, and complementary accommodations in the seven provinces of the island from January 2016 to December 2019. We perform the analysis by employing several statistical tests and through a visual inspection of the difference between the empirical distributions and the theoretical Benford's. Conformity to Benford's distribution is mostly confirmed for the total number of overnight stays and the data considered on a yearly basis. On the contrary, we found evident deviations from Benford's Law in the empirical distribution of data broken down by nationality of tourist and accommodation type. Some comments on possible motivations for such deviations are also advanced, even though a detailed exploration of them deserves a devoted study.
giu-2023
Settore SECS-S/06 -Metodi Mat. dell'Economia e d. Scienze Attuariali e Finanz.
Roy Cerqueti, Davide Provenzano (2023). Benford's Law for economic data reliability: The case of tourism flows in Sicily. CHAOS, SOLITONS AND FRACTALS [10.1016/j.chaos.2023.113635].
File in questo prodotto:
File Dimensione Formato  
CHAOS_113635.pdf

Solo gestori archvio

Tipologia: Post-print
Dimensione 935.62 kB
Formato Adobe PDF
935.62 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/594040
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact