Background: Genome‐wide mapping of protein‐DNA interactions has been widely used to investigate biological functions of the genome. An important question is to what extent such interactions are regulated at the DNA sequence level. However, current investigation is hampered by the lack of computational methods for systematic evaluating sequence specificity. Results: We present a simple, unbiased quantitative measure for DNA sequence specificity called the Motif Independent Measure (MIM). By analyzing both simulated and real experimental data, we found that the MIM measure can be used to detect sequence specificity independent of presence of transcription factor (TF) binding motifs. We also found that the level of specificity associated with H3K4me1 target sequences is highly cell‐type specific and highest in embryonic stem (ES) cells. We predicted H3K4me1 target sequences by using the Nscore model and found that the prediction accuracy is indeed high in ES cells. Conclusions: Our method provides a unified framework for quantifying DNA sequence specificity and serves as a guide for development of sequence‐based prediction models.

Pinello, L., Lo Bosco, G., Hanlon, B., Yuan, G. (2011). A motif-independent metric for DNA sequence specificity. BMC BIOINFORMATICS, 12 [10.1186/1471-2105-12-408].

A motif-independent metric for DNA sequence specificity

LO BOSCO, Giosue';
2011-01-01

Abstract

Background: Genome‐wide mapping of protein‐DNA interactions has been widely used to investigate biological functions of the genome. An important question is to what extent such interactions are regulated at the DNA sequence level. However, current investigation is hampered by the lack of computational methods for systematic evaluating sequence specificity. Results: We present a simple, unbiased quantitative measure for DNA sequence specificity called the Motif Independent Measure (MIM). By analyzing both simulated and real experimental data, we found that the MIM measure can be used to detect sequence specificity independent of presence of transcription factor (TF) binding motifs. We also found that the level of specificity associated with H3K4me1 target sequences is highly cell‐type specific and highest in embryonic stem (ES) cells. We predicted H3K4me1 target sequences by using the Nscore model and found that the prediction accuracy is indeed high in ES cells. Conclusions: Our method provides a unified framework for quantifying DNA sequence specificity and serves as a guide for development of sequence‐based prediction models.
Settore INF/01 - Informatica
Pinello, L., Lo Bosco, G., Hanlon, B., Yuan, G. (2011). A motif-independent metric for DNA sequence specificity. BMC BIOINFORMATICS, 12 [10.1186/1471-2105-12-408].
File in questo prodotto:
File Dimensione Formato  
Pinello et al. - 2011 - A motif-independent metric for DNA sequence specificity.pdf

accesso aperto

Dimensione 3.15 MB
Formato Adobe PDF
3.15 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/60523
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 17
  • ???jsp.display-item.citation.isi??? 16
social impact