In the framework of preference rankings, the interest can lie in finding which predictors and which interactions are able to explain the observed preference structures. The last years have seen a remarkable owering of works about the use of decision tree for clustering preference vectors. As a matter of fact, decision trees are useful and intuitive, but they are very unstable: small perturbations bring big changes. This is the reason why it could be necessary to use more stable procedures in order to clustering ranking data. In this work, following the idea of Bolton (2003), a Projection Pursuit (PP) clustering algorithm for preference data will be proposed in order to extract useful information in a low-dimensional subspace by starting from a high but most empty dimensional space. Projection pursuit clustering is a synthesis of projection pursuit and nonhierarchical clustering methods that simultaneously attempts to cluster the data and to find a low-dimensional representation of this cluster structure. As introduced by Huber (1985), a PP algorithm consists of two components: an index function I(α) that measures the "usefulness" of projection and a search algorithm that varies the projection direction so as to find the optimal projections, given the index function I(α) and the data set X. In this work a proper specified Projection index function for discrete data will be defined: several distances will be used to evaluate distances between the density of the projected data and the uninteresting uniform density. We also propose diagnostics for finding the optimum number of clusters in projection pursuit clustering. All the methodology is illustrated and evaluated on one simulated and one real dataset.

Sciandra Mariangela, D.A. (2018). A Projection Pursuit Algorithm for Preference Data. In Book of Abstracts SMTDA 2018 (pp. 1-1). ISAST.

A Projection Pursuit Algorithm for Preference Data

Sciandra Mariangela
;
Plaia Antonella
2018-01-01

Abstract

In the framework of preference rankings, the interest can lie in finding which predictors and which interactions are able to explain the observed preference structures. The last years have seen a remarkable owering of works about the use of decision tree for clustering preference vectors. As a matter of fact, decision trees are useful and intuitive, but they are very unstable: small perturbations bring big changes. This is the reason why it could be necessary to use more stable procedures in order to clustering ranking data. In this work, following the idea of Bolton (2003), a Projection Pursuit (PP) clustering algorithm for preference data will be proposed in order to extract useful information in a low-dimensional subspace by starting from a high but most empty dimensional space. Projection pursuit clustering is a synthesis of projection pursuit and nonhierarchical clustering methods that simultaneously attempts to cluster the data and to find a low-dimensional representation of this cluster structure. As introduced by Huber (1985), a PP algorithm consists of two components: an index function I(α) that measures the "usefulness" of projection and a search algorithm that varies the projection direction so as to find the optimal projections, given the index function I(α) and the data set X. In this work a proper specified Projection index function for discrete data will be defined: several distances will be used to evaluate distances between the density of the projected data and the uninteresting uniform density. We also propose diagnostics for finding the optimum number of clusters in projection pursuit clustering. All the methodology is illustrated and evaluated on one simulated and one real dataset.
2018
Projetion pursuit, preference data, Clustering rankings
978-618-5180-29-4
Sciandra Mariangela, D.A. (2018). A Projection Pursuit Algorithm for Preference Data. In Book of Abstracts SMTDA 2018 (pp. 1-1). ISAST.
File in questo prodotto:
File Dimensione Formato  
Book_of_Abstracts_SMTDA2018.pdf

accesso aperto

Dimensione 1.18 MB
Formato Adobe PDF
1.18 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/298599
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact