In this paper we present an hypothesis test of randomness based on the probability density function of the symmetrized Kulback-Leibler distance estimated, via a Monte Carlo simulation, by the distributions of the interval lengths detected using the Multi-Layer Model (MLM). The $MLM$ is based on the generation of several sub-samples of an input signal; in particular a set of optimal cut-set thresholds are applied to the data to detect signal properties. In this sense MLM is a general pattern detection method and it can be considered a preprocessing tool for pattern discovery. At the present the test has been evaluated on simulated signals which respect a particular tiled microarray approach used to reveal nucleosome positioning on Saccharomyces cerevisiae; this in order to control the accuracy of the proposed test of randomness. It has been also applied to real biological data. Results indicate that such statistical test may indicate the presence of structures in the signal with low signal to noise ratio.

Di Gesù, V., Lo Bosco, G., Pinello, L. (2009). Interval Length Analysis in Multi Layer Model. In F. Masulli, R. Tagliaferri, G.M. Verkhivker (a cura di), Computational Intelligence Methods for Bioinformatics and Biostatistics, 5th International Meeting, CIBB 2008 Vietri sul Mare, Italy, October 3-4, 2008 Revised Selected Papers (pp. 114-122) [10.1007/978-3-642-02504-4_10].

Interval Length Analysis in Multi Layer Model

DI GESU', Vito;LO BOSCO, Giosue';PINELLO, Luca
2009-01-01

Abstract

In this paper we present an hypothesis test of randomness based on the probability density function of the symmetrized Kulback-Leibler distance estimated, via a Monte Carlo simulation, by the distributions of the interval lengths detected using the Multi-Layer Model (MLM). The $MLM$ is based on the generation of several sub-samples of an input signal; in particular a set of optimal cut-set thresholds are applied to the data to detect signal properties. In this sense MLM is a general pattern detection method and it can be considered a preprocessing tool for pattern discovery. At the present the test has been evaluated on simulated signals which respect a particular tiled microarray approach used to reveal nucleosome positioning on Saccharomyces cerevisiae; this in order to control the accuracy of the proposed test of randomness. It has been also applied to real biological data. Results indicate that such statistical test may indicate the presence of structures in the signal with low signal to noise ratio.
2009
Settore INF/01 - Informatica
978-3-642-02503-7
978-3-642-02504-4
Di Gesù, V., Lo Bosco, G., Pinello, L. (2009). Interval Length Analysis in Multi Layer Model. In F. Masulli, R. Tagliaferri, G.M. Verkhivker (a cura di), Computational Intelligence Methods for Bioinformatics and Biostatistics, 5th International Meeting, CIBB 2008 Vietri sul Mare, Italy, October 3-4, 2008 Revised Selected Papers (pp. 114-122) [10.1007/978-3-642-02504-4_10].
File in questo prodotto:
File Dimensione Formato  
Di Gesù, Lo Bosco, Pinello - 2008 - Interval Length Analysis in Multi Layer Model.pdf

Solo gestori archvio

Tipologia: Versione Editoriale
Dimensione 431.37 kB
Formato Adobe PDF
431.37 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/40101
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact