Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

Decision tree learning is among the most popular and most traditional families of machine learning algorithms. While these techniques excel in being quite intuitive and interpretable, they also suffer from instability: small perturbations in the training data may result in big changes in the predictions. The so-called ensemble methods combine the output of multiple trees, which makes the decision more reliable and stable. They have been primarily applied to numeric prediction problems and to classification tasks. In the last years, some attempts to extend the ensemble methods to ordinal data can be found in the literature, but no concrete methodology has been provided for preference data. In this paper, we extend decision trees, and in the following also ensemble methods to ranking data. In particular, we propose a theoretical and computational definition of bagging and boosting, two of the best known ensemble methods. In an experimental study using simulated data and real-world datasets, our results confirm that known results from classification, such as that boosting outperforms bagging, could be successfully carried over to the ranking case.

Plaia A., Buscemi S., Furnkranz J., Mencia E.L. (2022). Comparing Boosting and Bagging for Decision Trees of Rankings. JOURNAL OF CLASSIFICATION, 39, 78-99 [10.1007/s00357-021-09397-2].

Comparing Boosting and Bagging for Decision Trees of Rankings

Plaia A.^Primo;Buscemi S.;Furnkranz J.;Mencia E. L.

2022-03-01

Abstract

Decision tree learning is among the most popular and most traditional families of machine learning algorithms. While these techniques excel in being quite intuitive and interpretable, they also suffer from instability: small perturbations in the training data may result in big changes in the predictions. The so-called ensemble methods combine the output of multiple trees, which makes the decision more reliable and stable. They have been primarily applied to numeric prediction problems and to classification tasks. In the last years, some attempts to extend the ensemble methods to ordinal data can be found in the literature, but no concrete methodology has been provided for preference data. In this paper, we extend decision trees, and in the following also ensemble methods to ranking data. In particular, we propose a theoretical and computational definition of bagging and boosting, two of the best known ensemble methods. In an experimental study using simulated data and real-world datasets, our results confirm that known results from classification, such as that boosting outperforms bagging, could be successfully carried over to the ranking case.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				mar-2022
			
	Titolo del periodico 
DATO PREVISTO SU LOGINMIUR
	
				JOURNAL OF CLASSIFICATION
			
	DOI del contributo 
DATO PREVISTO SU LOGINMIUR
	
				https://dx.doi.org/10.1007/s00357-021-09397-2
			
	URL dell'editore (Open access ove possibile)
	
				https://link.springer.com/article/10.1007/s00357-021-09397-2
			
	Citazione
	
				Plaia A.,  Buscemi S.,  Furnkranz J.,  Mencia E.L. (2022). Comparing Boosting and Bagging for Decision Trees of Rankings. JOURNAL OF CLASSIFICATION, 39, 78-99 [10.1007/s00357-021-09397-2].
			
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
s00357-021-09397-2.pdf accesso aperto Descrizione: This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. Tipologia: Versione Editoriale Dimensione 1.4 MB Formato Adobe PDF Visualizza/Apri	1.4 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/518398

Citazioni

ND

16

12

social impact