Preference data represent a particular type of ranking data (widely used in sports, web search, social sciences), where a group of people gives their preferences over a set of alternatives. Within this framework, distance-based decision trees represent a non-parametric tool for identifying the profiles of subjects giving a similar ranking. This paper aims at detecting, in the framework of (complete and incomplete) ranking data, the impact of the differently structured weighted distances for building decision trees. The traditional metrics between rankings don’t take into account the importance of swapping elements similar among them (element weights) or elements belonging to the top (or to the bottom) of an ordering (position weights). By means of simulations, using weighted distances to build decision trees, we will compute the impact of different weighting structures both on splitting and on consensus ranking. The distances that will be used satisfy Kemenys axioms and, accordingly, a modified version of the rank correlation coefficient τx, proposed by Edmond and Mason, will be proposed and used for assessing the trees’ goodness.
Plaia Antonella, Sciandra Mariangela, Buscemi Simona (2018). Weighted and unweighted distances based decision tree for ranking data. In Book of short Papers SIS 2018 (pp. 1-7). Pearson.
Weighted and unweighted distances based decision tree for ranking data
Plaia Antonella;Sciandra Mariangela;Buscemi Simona
2018-01-01
Abstract
Preference data represent a particular type of ranking data (widely used in sports, web search, social sciences), where a group of people gives their preferences over a set of alternatives. Within this framework, distance-based decision trees represent a non-parametric tool for identifying the profiles of subjects giving a similar ranking. This paper aims at detecting, in the framework of (complete and incomplete) ranking data, the impact of the differently structured weighted distances for building decision trees. The traditional metrics between rankings don’t take into account the importance of swapping elements similar among them (element weights) or elements belonging to the top (or to the bottom) of an ordering (position weights). By means of simulations, using weighted distances to build decision trees, we will compute the impact of different weighting structures both on splitting and on consensus ranking. The distances that will be used satisfy Kemenys axioms and, accordingly, a modified version of the rank correlation coefficient τx, proposed by Edmond and Mason, will be proposed and used for assessing the trees’ goodness.File | Dimensione | Formato | |
---|---|---|---|
SIS 2018 Plaia Buscemi Sciandra ISBN 9788891910233.pdf
accesso aperto
Descrizione: Articolo principale
Dimensione
383.51 kB
Formato
Adobe PDF
|
383.51 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.