In a clickstream analysis setting, Mixture Hidden Markov Models (MHMMs) can be used to examine categorical sequences assuming they evolve according to a mixture of latent Markov processes, each related to a different subpopulation. These models involve identifying both the number of subpopulations and hidden states. This study proposes a model selection criterion based on an integrated completed likelihood approach that accounts for the two latent classes in the model.We implemented a Monte Carlo simulation study to compare selection criteria performance. In scenarios characterised by categorical short length sequences, our proposed measure outperforms the most commonly used model selection criteria in identifying components and states. The paper presents a case study on clickstream data collected from the website of a company operating in the hospitality industry and modelled by an MHMM selected by the proposed score.

Urso, F., Abbruzzo, A., Chiodi, M., Cracolici, M.F. (2024). Model selection for mixture hidden Markov models: an application to clickstream data. STATISTICAL PAPERS [10.1007/s00362-024-01608-3].

Model selection for mixture hidden Markov models: an application to clickstream data

Urso, Furio;Abbruzzo, Antonino
;
Chiodi, Marcello;Cracolici, Maria Francesca
2024-10-19

Abstract

In a clickstream analysis setting, Mixture Hidden Markov Models (MHMMs) can be used to examine categorical sequences assuming they evolve according to a mixture of latent Markov processes, each related to a different subpopulation. These models involve identifying both the number of subpopulations and hidden states. This study proposes a model selection criterion based on an integrated completed likelihood approach that accounts for the two latent classes in the model.We implemented a Monte Carlo simulation study to compare selection criteria performance. In scenarios characterised by categorical short length sequences, our proposed measure outperforms the most commonly used model selection criteria in identifying components and states. The paper presents a case study on clickstream data collected from the website of a company operating in the hospitality industry and modelled by an MHMM selected by the proposed score.
19-ott-2024
Urso, F., Abbruzzo, A., Chiodi, M., Cracolici, M.F. (2024). Model selection for mixture hidden Markov models: an application to clickstream data. STATISTICAL PAPERS [10.1007/s00362-024-01608-3].
File in questo prodotto:
File Dimensione Formato  
Urso_Abbruzzo_Chiodi_Cracolici.pdf

accesso aperto

Descrizione: Manuscript
Tipologia: Versione Editoriale
Dimensione 724.15 kB
Formato Adobe PDF
724.15 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/661744
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact