Unknown

Dataset Information

0

DupMasker: a tool for annotating primate segmental duplications.


ABSTRACT: Segmental duplications (SDs) play an important role in genome rearrangement, evolution, and the copy-number variation (CNV) of primate genomes. Such sequences are difficult to detect, a priori, because they share no defining sequence features that distinguish them from unique portions of the genome. Current sequence annotation of segmental duplications requires computationally intensive, genome-wide self-comparisons that cannot be easily implemented on new data sets. Based on the successful implementation of RepeatMasker, we developed a new genome annotation tool, DupMasker. The program uses a library of nonredundant consensus sequences of human segmental duplications, wherein a majority of the ancestral origins have been determined based on comparisons to mammalian outgroup genomes. Using DupMasker, new human and nonhuman primate (NHP) sequences may be readily queried to provide details on the origin and degree of sequence identity of each duplicon. This program can be applied to delineate the order and orientation of duplicons within complex duplication blocks and used to characterize structural variation differences between sequenced human haplotypes. We predict this tool will be valuable in the annotation of large-insert sequence clones, allowing putative unique and duplicated regions of the genomes to be annotated prior to whole genome assembly comparisons.

SUBMITTER: Jiang Z 

PROVIDER: S-EPMC2493431 | biostudies-literature | 2008 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

DupMasker: a tool for annotating primate segmental duplications.

Jiang Zhaoshi Z   Hubley Robert R   Smit Arian A   Eichler Evan E EE  

Genome research 20080523 8


Segmental duplications (SDs) play an important role in genome rearrangement, evolution, and the copy-number variation (CNV) of primate genomes. Such sequences are difficult to detect, a priori, because they share no defining sequence features that distinguish them from unique portions of the genome. Current sequence annotation of segmental duplications requires computationally intensive, genome-wide self-comparisons that cannot be easily implemented on new data sets. Based on the successful impl  ...[more]

Similar Datasets

| S-EPMC3906575 | biostudies-literature
| S-EPMC525679 | biostudies-literature
| S-EPMC3735471 | biostudies-literature
| S-EPMC8254307 | biostudies-literature
2008-10-31 | GSE13266 | GEO
| S-EPMC2714723 | biostudies-literature
| S-EPMC2935423 | biostudies-literature
| S-EPMC8049597 | biostudies-literature
| S-EPMC6382464 | biostudies-literature
2010-05-17 | E-GEOD-13266 | biostudies-arrayexpress