Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

Sequence comparison i.e., The assessment of how similar two biological sequences are to each other, is a fundamental and routine task in Computational Biology and Bioinformatics. Classically, alignment methods are the de facto standard for such an assessment. In fact, considerable research efforts for the development of efficient algorithms, both on classic and parallel architectures, has been carried out in the past 50 years. Due to the growing amount of sequence data being produced, a new class of methods has emerged: Alignment-free methods. Research in this ares has become very intense in the past few years, stimulated by the advent of Next Generation Sequencing technologies, since those new methods are very appealing in terms of computational resources needed and biological relevance. Despite such an effort and in contrast with sequence alignment methods, no systematic investigation of how to take advantage of distributed architectures to speed up alignment-free methods, has taken place. We provide a contribution of that kind, by evaluating the possibility of using the Hadoop distributed framework to speed up the running times of these methods, compared to their original sequential formulation

Cattaneo, G., Petrillo, U., Giancarlo, R., Roscigno, G. (2015). Alignment-Free Sequence Comparison over Hadoop for Computational Biology. In Proceedings of the International Conference on Parallel Processing Workshops (pp. 184-192). Institute of Electrical and Electronics Engineers Inc. [10.1109/ICPPW.2015.28].

Alignment-Free Sequence Comparison over Hadoop for Computational Biology

Cattaneo, G.;Petrillo, U.;GIANCARLO, Raffaele;Roscigno, G.

2015-01-01

Abstract

Sequence comparison i.e., The assessment of how similar two biological sequences are to each other, is a fundamental and routine task in Computational Biology and Bioinformatics. Classically, alignment methods are the de facto standard for such an assessment. In fact, considerable research efforts for the development of efficient algorithms, both on classic and parallel architectures, has been carried out in the past 50 years. Due to the growing amount of sequence data being produced, a new class of methods has emerged: Alignment-free methods. Research in this ares has become very intense in the past few years, stimulated by the advent of Next Generation Sequencing technologies, since those new methods are very appealing in terms of computational resources needed and biological relevance. Despite such an effort and in contrast with sequence alignment methods, no systematic investigation of how to take advantage of distributed architectures to speed up alignment-free methods, has taken place. We provide a contribution of that kind, by evaluating the possibility of using the Hadoop distributed framework to speed up the running times of these methods, compared to their original sequential formulation

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				2015
			
	ISBN della monografia 
DATO PREVISTO SU LOGINMIUR
	
				9781467375894
			
	DOI del contributo 
DATO PREVISTO SU LOGINMIUR
	
				https://dx.doi.org/10.1109/ICPPW.2015.28
			
	Citazione
	
				Cattaneo, G., Petrillo, U., Giancarlo, R., Roscigno, G. (2015). Alignment-Free Sequence Comparison over Hadoop for Computational Biology. In Proceedings of the International Conference on Parallel Processing Workshops (pp. 184-192). Institute of Electrical and Electronics Engineers Inc. [10.1109/ICPPW.2015.28].
			
	Appare nelle tipologie:
	
				2.07 Contributo in atti di convegno pubblicato in volume

File in questo prodotto:

File	Dimensione	Formato
Alignment-Free_Sequence_Comparison_over_Hadoop_for_Computational_Biology.pdf Solo gestori archvio Dimensione 152.59 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	152.59 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/201070

Citazioni

ND

7

3

social impact