Bioinformatics has a long history of software solutions developed on multi-core computing systems for solving computational intensive problems. This option suffer from some issues solvable by shifting to Distributed Systems. In particular, the MapReduce computing paradigm, and its implementations, Hadoop and Spark, is becoming increasingly popular in the Bioinformatics field because it allows for virtual-unlimited horizontal scalability while being easy-to-use. Here we provide a qualitative evaluation of some of the most significant MapReduce bioinformatics applications. We also focus on one of these applications to show the importance of correctly engineering an application to fully exploit the potential of Distributed Systems.

Cattaneo, G., Giancarlo, R., Petrillo, U.F., Roscigno, G. (2017). MapReduce in Computational Biology Via Hadoop and Spark. In Reference Module in the Life Sciences. Elsevier [10.1016/B978-0-12-809633-8.20371-3].

MapReduce in Computational Biology Via Hadoop and Spark

Cattaneo, Giuseppe;Giancarlo, Raffaele;Petrillo, Umberto Ferraro;
2017-01-01

Abstract

Bioinformatics has a long history of software solutions developed on multi-core computing systems for solving computational intensive problems. This option suffer from some issues solvable by shifting to Distributed Systems. In particular, the MapReduce computing paradigm, and its implementations, Hadoop and Spark, is becoming increasingly popular in the Bioinformatics field because it allows for virtual-unlimited horizontal scalability while being easy-to-use. Here we provide a qualitative evaluation of some of the most significant MapReduce bioinformatics applications. We also focus on one of these applications to show the importance of correctly engineering an application to fully exploit the potential of Distributed Systems.
2017
Settore INF/01 - Informatica
Cattaneo, G., Giancarlo, R., Petrillo, U.F., Roscigno, G. (2017). MapReduce in Computational Biology Via Hadoop and Spark. In Reference Module in the Life Sciences. Elsevier [10.1016/B978-0-12-809633-8.20371-3].
File in questo prodotto:
File Dimensione Formato  
cattaneo2018.pdf

Solo gestori archvio

Dimensione 344.58 kB
Formato Adobe PDF
344.58 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/291372
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact