Genomic sequences are usually compared using evolutionary distance, a procedure that implies the alignment of the sequences. Alignment of long sequences is a time consuming procedure and the obtained dissimilarity results is not a metric. Recently, the normalised compression distance was introduced as a method to calculate the distance between two generic digital objects and it seems a suitable way to compare genomic strings. In this paper, the clustering and the non-linear mapping obtained using the evolutionary distance and the compression distance are compared, in order to understand if the two distances sets are similar.
Rosa, M., Gaglio, S., Rizzo, R., Urso, A. (2009). Normalised compression distance and evolutionary distance of genomic sequences: comparison of clustering results. INTERNATIONAL JOURNAL OF KNOWLEDGE ENGINEERING AND SOFT DATA PARADIGMS, 1(4), 345-362 [10.1504/IJKESDP.2009.028987].
Normalised compression distance and evolutionary distance of genomic sequences: comparison of clustering results
GAGLIO, Salvatore;
2009-01-01
Abstract
Genomic sequences are usually compared using evolutionary distance, a procedure that implies the alignment of the sequences. Alignment of long sequences is a time consuming procedure and the obtained dissimilarity results is not a metric. Recently, the normalised compression distance was introduced as a method to calculate the distance between two generic digital objects and it seems a suitable way to compare genomic strings. In this paper, the clustering and the non-linear mapping obtained using the evolutionary distance and the compression distance are compared, in order to understand if the two distances sets are similar.File | Dimensione | Formato | |
---|---|---|---|
Normalised compression distance and evolutionary distance of genomic sequences - comparison of clustering results.pdf
Solo gestori archvio
Descrizione: Articolo Principale
Dimensione
302.67 kB
Formato
Adobe PDF
|
302.67 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.