In this paper we present an hypothesis test of randomness based on the probability density function of the symmetrized Kulback-Leibler distance estimated, via a Monte Carlo simulation, by the distributions of the interval lengths detected using the Multi-Layer Model (MLM). The $MLM$ is based on the generation of several sub-samples of an input signal; in particular a set of optimal cut-set thresholds are applied to the data to detect signal properties. In this sense MLM is a general pattern detection method and it can be considered a preprocessing tool for pattern discovery. At the present the test has been evaluated on simulated signals which respect a particular tiled microarray approach used to reveal nucleosome positioning on Saccharomyces cerevisiae; this in order to control the accuracy of the proposed test of randomness. It has been also applied to real biological data. Results indicate that such statistical test may indicate the presence of structures in the signal with low signal to noise ratio.
Di Gesù, V., Lo Bosco, G., Pinello, L. (2009). Interval Length Analysis in Multi Layer Model. In F. Masulli, R. Tagliaferri, G.M. Verkhivker (a cura di), Computational Intelligence Methods for Bioinformatics and Biostatistics, 5th International Meeting, CIBB 2008 Vietri sul Mare, Italy, October 3-4, 2008 Revised Selected Papers (pp. 114-122) [10.1007/978-3-642-02504-4_10].
Interval Length Analysis in Multi Layer Model
DI GESU', Vito;LO BOSCO, Giosue';PINELLO, Luca
2009-01-01
Abstract
In this paper we present an hypothesis test of randomness based on the probability density function of the symmetrized Kulback-Leibler distance estimated, via a Monte Carlo simulation, by the distributions of the interval lengths detected using the Multi-Layer Model (MLM). The $MLM$ is based on the generation of several sub-samples of an input signal; in particular a set of optimal cut-set thresholds are applied to the data to detect signal properties. In this sense MLM is a general pattern detection method and it can be considered a preprocessing tool for pattern discovery. At the present the test has been evaluated on simulated signals which respect a particular tiled microarray approach used to reveal nucleosome positioning on Saccharomyces cerevisiae; this in order to control the accuracy of the proposed test of randomness. It has been also applied to real biological data. Results indicate that such statistical test may indicate the presence of structures in the signal with low signal to noise ratio.File | Dimensione | Formato | |
---|---|---|---|
Di Gesù, Lo Bosco, Pinello - 2008 - Interval Length Analysis in Multi Layer Model.pdf
Solo gestori archvio
Tipologia:
Versione Editoriale
Dimensione
431.37 kB
Formato
Adobe PDF
|
431.37 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.