Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

DNA sequences are the basic data type that is processed to perform a generic study of biological data analysis. One key component of the biological analysis is represented by sequence classiﬁcation, a methodology that is widely used to analyze sequential data of different nature. However, its application to DNA sequences requires a proper representation of such sequences, which is still an open research problem. Machine Learning (ML) methodologies have given a fundamental contribution to the solution of the problem. Among them, recently, also Deep Neural Network (DNN) models have shown strongly encouraging results. In this chapter, we deal with speciﬁc classiﬁcation problems related to two biological scenarios: (A) metagenomics and (B) chromatin organization. The investigations have been carried out by considering DNA sequences as input data for the classiﬁca-tion methodologies. In particular, we study and test the efﬁcacy of (1) different DNA sequence representations and (2) several Deep Learning (DL) architectures that process sequences for the solution of the related supervised classiﬁcation problems. Although developed for speciﬁc classiﬁcation tasks, we think that such architectures could be served as a suggestion for developing other DNN models that process the same kind of input.

Amato, D., Di Gangi, M.A., Fiannaca, A., La Paglia, L., La Rosa, M., Lo Bosco, G., et al. (2021). Classification of Sequences with Deep Artificial Neural Networks: Representation and Architectural Issues. In M. Elloumi (a cura di), Deep Learning for Biomedical Data Analysis (pp. 27-59) [10.1007/978-3-030-71676-9_2].

Classification of Sequences with Deep Artificial Neural Networks: Representation and Architectural Issues

Amato, Domenico;Di Gangi, Mattia Antonino;Fiannaca, Antonino;La Paglia, Laura;La Rosa, Massimo;Lo Bosco, Giosué;Rizzo, Riccardo;Urso, Alfonso

2021-07-01

Abstract

DNA sequences are the basic data type that is processed to perform a generic study of biological data analysis. One key component of the biological analysis is represented by sequence classiﬁcation, a methodology that is widely used to analyze sequential data of different nature. However, its application to DNA sequences requires a proper representation of such sequences, which is still an open research problem. Machine Learning (ML) methodologies have given a fundamental contribution to the solution of the problem. Among them, recently, also Deep Neural Network (DNN) models have shown strongly encouraging results. In this chapter, we deal with speciﬁc classiﬁcation problems related to two biological scenarios: (A) metagenomics and (B) chromatin organization. The investigations have been carried out by considering DNA sequences as input data for the classiﬁca-tion methodologies. In particular, we study and test the efﬁcacy of (1) different DNA sequence representations and (2) several Deep Learning (DL) architectures that process sequences for the solution of the related supervised classiﬁcation problems. Although developed for speciﬁc classiﬁcation tasks, we think that such architectures could be served as a suggestion for developing other DNN models that process the same kind of input.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				lug-2021
			
	DOI del contributo 
DATO PREVISTO SU LOGINMIUR
	
				https://dx.doi.org/10.1007/978-3-030-71676-9_2
			
	URL dell'editore (Open access ove possibile)
	
				https://www.springer.com/gp/book/9783030716752
			
	Citazione
	
				Amato, D., Di Gangi, M.A., Fiannaca, A., La Paglia, L., La Rosa, M., Lo Bosco, G., et al. (2021). Classification of Sequences with Deep Artificial Neural Networks: Representation and Architectural Issues. In M. Elloumi (a cura di), Deep Learning for Biomedical Data Analysis (pp. 27-59) [10.1007/978-3-030-71676-9_2].
			
	Appare nelle tipologie:
	
				2.01 Capitolo o Saggio

File in questo prodotto:

File	Dimensione	Formato
Amato_et_al.pdf Solo gestori archvio Tipologia: Versione Editoriale Dimensione 684.81 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	684.81 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/515568

Citazioni

ND

8

ND

social impact