Unknown

Dataset Information

0

Reliable prediction of transcription factor binding sites by phylogenetic verification.


ABSTRACT: We present a statistical methodology that largely improves the accuracy in computational predictions of transcription factor (TF) binding sites in eukaryote genomes. This method models the cross-species conservation of binding sites without relying on accurate sequence alignment. It can be coupled with any motif-finding algorithm that searches for overrepresented sequence motifs in individual species and can increase the accuracy of the coupled motif-finding algorithm. Because this method is capable of accurately detecting TF binding sites, it also enhances our ability to predict the cis-regulatory modules. We applied this method on the published chromatin immunoprecipitation (ChIP)-chip data in Saccharomyces cerevisiae and found that its sensitivity and specificity are 9% and 14% higher than those of two recent methods. We also recovered almost all of the previously verified TF binding sites and made predictions on the cis-regulatory elements that govern the tight regulation of ribosomal protein genes in 13 eukaryote species (2 plants, 4 yeasts, 2 worms, 2 insects, and 3 mammals). These results give insights to the transcriptional regulation in eukaryotic organisms.

SUBMITTER: Li X 

PROVIDER: S-EPMC1283155 | biostudies-literature | 2005 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Reliable prediction of transcription factor binding sites by phylogenetic verification.

Li Xiaoman X   Zhong Sheng S   Wong Wing H WH  

Proceedings of the National Academy of Sciences of the United States of America 20051114 47


We present a statistical methodology that largely improves the accuracy in computational predictions of transcription factor (TF) binding sites in eukaryote genomes. This method models the cross-species conservation of binding sites without relying on accurate sequence alignment. It can be coupled with any motif-finding algorithm that searches for overrepresented sequence motifs in individual species and can increase the accuracy of the coupled motif-finding algorithm. Because this method is cap  ...[more]

Similar Datasets

| S-EPMC30389 | biostudies-literature
| S-EPMC3931712 | biostudies-literature
| S-EPMC6360811 | biostudies-literature
| S-EPMC2847719 | biostudies-literature
| S-EPMC9204005 | biostudies-literature
| S-EPMC441601 | biostudies-literature
| S-EPMC2931695 | biostudies-other
| S-EPMC1866359 | biostudies-literature
| S-EPMC3526440 | biostudies-literature
| S-EPMC2922897 | biostudies-literature