Data heterogeneity, within a (linear) regression framework, often suggests the use of a Clusterwise Linear Regression (CLR) procedure, which implies, among other things, the estimate of the appropriate number of clusters as well as the cluster membership of each unit. The approaches to the estimation of a CLR model are essentially based on the Ordinary Least Square (OLS) criterion or the likelihood criterion. In this paper, in a context of OLS approach, we propose an estimation of the model making use of an algorithm based on a threshold criterion for the determination coefficient of each cluster, to identify the appropriate number of clusters, and of a modified Spath's algorithm, to estimate the cluster membership of each sample unit. A simulation design and an application to a real data-set show that the procedure outperforms other algorithms commonly used in the literature.

Plaia, A., Bologna, S. (2009). A new OLS-based procedure for clusterwise linear regression. STATISTICA & APPLICAZIONI, VII(January-June), 45-61.

A new OLS-based procedure for clusterwise linear regression

PLAIA, Antonella;BOLOGNA, Salvatore
2009-01-01

Abstract

Data heterogeneity, within a (linear) regression framework, often suggests the use of a Clusterwise Linear Regression (CLR) procedure, which implies, among other things, the estimate of the appropriate number of clusters as well as the cluster membership of each unit. The approaches to the estimation of a CLR model are essentially based on the Ordinary Least Square (OLS) criterion or the likelihood criterion. In this paper, in a context of OLS approach, we propose an estimation of the model making use of an algorithm based on a threshold criterion for the determination coefficient of each cluster, to identify the appropriate number of clusters, and of a modified Spath's algorithm, to estimate the cluster membership of each sample unit. A simulation design and an application to a real data-set show that the procedure outperforms other algorithms commonly used in the literature.
2009
Settore SECS-S/01 - Statistica
Plaia, A., Bologna, S. (2009). A new OLS-based procedure for clusterwise linear regression. STATISTICA & APPLICAZIONI, VII(January-June), 45-61.
File in questo prodotto:
File Dimensione Formato  
plaia_bologna_S&A_finale.pdf

Solo gestori archvio

Dimensione 534.38 kB
Formato Adobe PDF
534.38 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/38507
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact