Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

Several recent works have shown that K-mer sequence representation of a DNA sequence can be used for classification or identification of nucleosome positioning related sequences. This representation can be computationally expensive when k grows, making the complexity in spaces of exponential dimension. This issue effects significantly the classification task computed by a general machine learning algorithm used for the purpose of sequence classification. In this paper, we investigate the advantage offered by the so-called Variable Ranking Feature Selection method to select the most informative k − mers associated to a set of DNA sequences, for the final purpose of nucleosome/linker classification by a deep learning network. Results computed on three public datasets show the effectiveness of the adopted feature selection method.

Lo Bosco, G., Rizzo, R., Fiannaca, A., La Rosa, M., Urso, A. (2018). Variable Ranking Feature Selection for the Identification of Nucleosome Related Sequences. In A. Benczúr, B. Thalheim, T. Horváth, S. Chiusano, T. Cerquitelli, C. Sidló, et al. (a cura di), New Trends in Databases and Information Systems (pp. 314-324). Springer Verlag [10.1007/978-3-030-00063-9_30].

Variable Ranking Feature Selection for the Identification of Nucleosome Related Sequences

Lo Bosco, Giosué;Rizzo, Riccardo;Fiannaca, Antonino;La Rosa, Massimo;Urso, Alfonso

2018-01-01

Abstract

Several recent works have shown that K-mer sequence representation of a DNA sequence can be used for classification or identification of nucleosome positioning related sequences. This representation can be computationally expensive when k grows, making the complexity in spaces of exponential dimension. This issue effects significantly the classification task computed by a general machine learning algorithm used for the purpose of sequence classification. In this paper, we investigate the advantage offered by the so-called Variable Ranking Feature Selection method to select the most informative k − mers associated to a set of DNA sequences, for the final purpose of nucleosome/linker classification by a deep learning network. Results computed on three public datasets show the effectiveness of the adopted feature selection method.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				2018
			
	ISBN della monografia 
DATO PREVISTO SU LOGINMIUR
	
				978-3-030-00062-2
			
	DOI del contributo 
DATO PREVISTO SU LOGINMIUR
	
				https://dx.doi.org/10.1007/978-3-030-00063-9_30
			
	URL alternativo rispetto a quello dell'editore 
DATO PREVISTO SU LOGINMIUR
	
				https://link.springer.com/chapter/10.1007/978-3-030-00063-9_30
			
	Citazione
	
				Lo Bosco, G., Rizzo, R., Fiannaca, A., La Rosa, M., Urso, A. (2018). Variable Ranking Feature Selection for the Identification of Nucleosome Related Sequences. In A. Benczúr, B. Thalheim, T. Horváth, S. Chiusano, T. Cerquitelli, C. Sidló, et al. (a cura di), New Trends in Databases and Information Systems (pp. 314-324). Springer Verlag [10.1007/978-3-030-00063-9_30].
			
	Appare nelle tipologie:
	
				2.07 Contributo in atti di convegno pubblicato in volume

File in questo prodotto:

File	Dimensione	Formato
10.1007@978-3-030-00063-930.pdf Solo gestori archvio Tipologia: Versione Editoriale Dimensione 1.26 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.26 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/295999

Citazioni

ND

3

2

social impact