Ontology highlight
ABSTRACT:
SUBMITTER: Alkan C
PROVIDER: S-EPMC1994983 | biostudies-literature | 2007 Sep
REPOSITORIES: biostudies-literature
Alkan Can C Ventura Mario M Archidiacono Nicoletta N Rocchi Mariano M Sahinalp S Cenk SC Eichler Evan E EE
PLoS computational biology 20070901 9
The major DNA constituent of primate centromeres is alpha satellite DNA. As much as 2%-5% of sequence generated as part of primate genome sequencing projects consists of this material, which is fragmented or not assembled as part of published genome sequences due to its highly repetitive nature. Here, we develop computational methods to rapidly recover and categorize alpha-satellite sequences from previously uncharacterized whole-genome shotgun sequence data. We present an algorithm to computati ...[more]