Unknown

Dataset Information

0

Annotation of cis-regulatory elements by identification, subclassification, and functional assessment of multispecies conserved sequences.


ABSTRACT: An important step toward improving the annotation of the human genome is to identify cis-acting regulatory elements from primary DNA sequence. One approach is to compare sequences from multiple, divergent species. This approach distinguishes multispecies conserved sequences (MCS) in noncoding regions from more rapidly evolving neutral DNA. Here, we have analyzed a region of approximately 238kb containing the human alpha globin cluster that was sequenced and/or annotated across the syntenic region in 22 species spanning 500 million years of evolution. Using a variety of bioinformatic approaches and correlating the results with many aspects of chromosome structure and function in this region, we were able to identify and evaluate the importance of 24 individual MCSs. This approach sensitively and accurately identified previously characterized regulatory elements but also discovered unidentified promoters, exons, splicing, and transcriptional regulatory elements. Together, these studies demonstrate an integrated approach by which to identify, subclassify, and predict the potential importance of MCSs.

SUBMITTER: Hughes JR 

PROVIDER: S-EPMC1174996 | biostudies-literature | 2005 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Annotation of cis-regulatory elements by identification, subclassification, and functional assessment of multispecies conserved sequences.

Hughes Jim R JR   Cheng Jan-Fang JF   Ventress Nicki N   Prabhakar Shyam S   Clark Kevin K   Anguita Eduardo E   De Gobbi Marco M   de Jong Pieter P   Rubin Eddy E   Higgs Douglas R DR  

Proceedings of the National Academy of Sciences of the United States of America 20050705 28


An important step toward improving the annotation of the human genome is to identify cis-acting regulatory elements from primary DNA sequence. One approach is to compare sequences from multiple, divergent species. This approach distinguishes multispecies conserved sequences (MCS) in noncoding regions from more rapidly evolving neutral DNA. Here, we have analyzed a region of approximately 238kb containing the human alpha globin cluster that was sequenced and/or annotated across the syntenic regio  ...[more]

Similar Datasets

| S-EPMC4653392 | biostudies-literature
| S-EPMC9118141 | biostudies-literature
| S-EPMC7200997 | biostudies-literature
| S-EPMC4668379 | biostudies-literature
| S-EPMC9597995 | biostudies-literature
| S-EPMC4179605 | biostudies-literature
| S-EPMC6161761 | biostudies-literature
| S-EPMC2235840 | biostudies-literature
| S-EPMC5994936 | biostudies-literature