Unknown

Dataset Information

0

Mapping and initial analysis of human subtelomeric sequence assemblies.


ABSTRACT: Physical mapping data were combined with public draft and finished sequences to derive subtelomeric sequence assemblies for each of the 41 genetically distinct human telomere regions. Sequence gaps that remain on the reference telomeres are generally small,well-defined,and for the most part,restricted to regions directly adjacent to the terminal (TTAGGG)n tract. Of the 20.66 Mb of subtelomeric DNA analyzed, 3.01 Mb are subtelomeric repeat sequences (Srpt),and an additional 2.11 Mb are segmental duplications. The subtelomeric sequence assemblies are enriched >25-fold in short,internal (TTAGGG)n-like sequences relative to the rest of the genome; a total of 114 (TTAGGG)n-like islands were found,55 within Srpt regions,35 within one-copy regions,11 at one-copy/Srpt or Srpt/segmental duplication boundaries,and 13 at the telomeric ends of assemblies. Transcripts were annotated in each assembly,noting their mapping coordinates relative to their respective telomere and whether they originate in duplicated DNA or single-copy DNA. A total of 697 transcripts were found in 15.53 Mb of one-copy DNA,76 transcripts in 2.11 Mb of segmentally duplicated DNA,and 168 transcripts in 3.01 Mb of Srpt sequence. This overall transcript density is similar (within approximately 10%) to that found genome-wide. Zinc finger-containing genes and olfactory receptor genes are duplicated within and between multiple telomere regions.

SUBMITTER: Riethman H 

PROVIDER: S-EPMC314271 | biostudies-literature | 2004 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Mapping and initial analysis of human subtelomeric sequence assemblies.

Riethman Harold H   Ambrosini Anthony A   Castaneda Carlos C   Finklestein Jeffrey J   Hu Xue-Lan XL   Mudunuri Uma U   Paul Sheila S   Wei Jun J  

Genome research 20040101 1


Physical mapping data were combined with public draft and finished sequences to derive subtelomeric sequence assemblies for each of the 41 genetically distinct human telomere regions. Sequence gaps that remain on the reference telomeres are generally small,well-defined,and for the most part,restricted to regions directly adjacent to the terminal (TTAGGG)n tract. Of the 20.66 Mb of subtelomeric DNA analyzed, 3.01 Mb are subtelomeric repeat sequences (Srpt),and an additional 2.11 Mb are segmental  ...[more]

Similar Datasets

| S-EPMC1270012 | biostudies-literature
| S-EPMC2756281 | biostudies-literature
| S-EPMC1160117 | biostudies-literature
| S-EPMC3113786 | biostudies-literature
| S-EPMC2731494 | biostudies-literature
2014-02-18 | E-GEOD-55053 | biostudies-arrayexpress
| S-EPMC2323237 | biostudies-literature
| S-EPMC4032850 | biostudies-literature
| S-EPMC4106270 | biostudies-literature
2014-02-18 | GSE55053 | GEO