Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

Data clustering algorithms represent mechanisms for partitioning huge arrays of multidimensional data into groups with small in–group and large out–group distances. Most of the existing algorithms fail when a lower bound for the distance among cluster centroids is specified, while this type of constraint can be of help in obtaining a better clustering. Traditional approaches require that the desired number of clusters are specified a priori, which requires either a subjective decision or global meta–information knowledge that is not easily obtainable. In this paper, an extension of the standard data clustering problem is addressed, including additional constraints on the cluster centroid distances. Based on the well–known Hegelsmann–Krause opinion dynamics model, an algorithm that is capable to find admissible solutions is given. A key feature of the algorithm is the ability to partition the original set of data into a suitable number of clusters, without the necessity to specify such a number in advance. In the proposed approach, instead, the maximum distance among any pair of cluster centroids is specified.

Oliva, G., La Manna, D., Fagiolini, A., Setola, R. (2014). Distance–Constrained Data Clustering by Combined k–means Algorithms and Opinion Dynamics Filters. In Proceedings of Mediterranean Conference on Control and Automation [10.1109/MED.2014.6961441].

Distance–Constrained Data Clustering by Combined k–means Algorithms and Opinion Dynamics Filters

Oliva, G;LA MANNA, Damiano;FAGIOLINI, Adriano;Setola, R.

2014-01-01

Abstract

Data clustering algorithms represent mechanisms for partitioning huge arrays of multidimensional data into groups with small in–group and large out–group distances. Most of the existing algorithms fail when a lower bound for the distance among cluster centroids is specified, while this type of constraint can be of help in obtaining a better clustering. Traditional approaches require that the desired number of clusters are specified a priori, which requires either a subjective decision or global meta–information knowledge that is not easily obtainable. In this paper, an extension of the standard data clustering problem is addressed, including additional constraints on the cluster centroid distances. Based on the well–known Hegelsmann–Krause opinion dynamics model, an algorithm that is capable to find admissible solutions is given. A key feature of the algorithm is the ability to partition the original set of data into a suitable number of clusters, without the necessity to specify such a number in advance. In the proposed approach, instead, the maximum distance among any pair of cluster centroids is specified.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				2014
			
	ISBN della monografia 
DATO PREVISTO SU LOGINMIUR
	
				978-1-4799-5901-3
			
	DOI del contributo 
DATO PREVISTO SU LOGINMIUR
	
				https://dx.doi.org/10.1109/MED.2014.6961441
			
	URL alternativo rispetto a quello dell'editore 
DATO PREVISTO SU LOGINMIUR
	
				http://ieeexplore.ieee.org/document/6961441/
			
	Citazione
	
				Oliva, G., La Manna, D., Fagiolini, A., Setola, R. (2014). Distance–Constrained Data Clustering by Combined k–means Algorithms and Opinion Dynamics Filters. In Proceedings of Mediterranean Conference on Control and Automation [10.1109/MED.2014.6961441].
			
	Appare nelle tipologie:
	
				2.07 Contributo in atti di convegno pubblicato in volume

File in questo prodotto:

File	Dimensione	Formato
06961441.pdf Solo gestori archvio Tipologia: Versione Editoriale Dimensione 1.13 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.13 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/98017

Citazioni

ND

2

1

social impact