Preference data represent a particular type of ranking data (widely used in sports, web search, social sciences), where a group of people gives their preferences over a set of alternatives. Within this framework, distance-based decision trees represent a non-parametric tool for identifying the profiles of subjects giving a similar ranking. This paper aims at detecting, in the framework of (complete and incomplete) ranking data, the impact of the differently structured weighted distances for building decision trees. The traditional metrics between rankings don’t take into account the importance of swapping elements similar among them (element weights) or elements belonging to the top (or to the bottom) of an ordering (position weights). By means of simulations, using weighted distances to build decision trees, we will compute the impact of different weighting structures both on splitting and on consensus ranking. The distances that will be used satisfy Kemenys axioms and, accordingly, a modified version of the rank correlation coefficient τx, proposed by Edmond and Mason, will be proposed and used for assessing the trees’ goodness.

Plaia Antonella, Sciandra Mariangela, Buscemi Simona (2018). Weighted and unweighted distances based decision tree for ranking data. In Book of short Papers SIS 2018 (pp. 1-7). Pearson.

Weighted and unweighted distances based decision tree for ranking data

Plaia Antonella;Sciandra Mariangela;Buscemi Simona
2018-01-01

Abstract

Preference data represent a particular type of ranking data (widely used in sports, web search, social sciences), where a group of people gives their preferences over a set of alternatives. Within this framework, distance-based decision trees represent a non-parametric tool for identifying the profiles of subjects giving a similar ranking. This paper aims at detecting, in the framework of (complete and incomplete) ranking data, the impact of the differently structured weighted distances for building decision trees. The traditional metrics between rankings don’t take into account the importance of swapping elements similar among them (element weights) or elements belonging to the top (or to the bottom) of an ordering (position weights). By means of simulations, using weighted distances to build decision trees, we will compute the impact of different weighting structures both on splitting and on consensus ranking. The distances that will be used satisfy Kemenys axioms and, accordingly, a modified version of the rank correlation coefficient τx, proposed by Edmond and Mason, will be proposed and used for assessing the trees’ goodness.
2018
Settore SECS-S/01 - Statistica
978-8-8919-1023-3
Plaia Antonella, Sciandra Mariangela, Buscemi Simona (2018). Weighted and unweighted distances based decision tree for ranking data. In Book of short Papers SIS 2018 (pp. 1-7). Pearson.
File in questo prodotto:
File Dimensione Formato  
SIS 2018 Plaia Buscemi Sciandra ISBN 9788891910233.pdf

accesso aperto

Descrizione: Articolo principale
Dimensione 383.51 kB
Formato Adobe PDF
383.51 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/298188
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact