Unknown

Dataset Information

0

Tandem repeats and G-rich sequences are enriched at human CNV breakpoints.


ABSTRACT: Chromosome breakage in germline and somatic genomes gives rise to copy number variation (CNV) responsible for genomic disorders and tumorigenesis. DNA sequence is known to play an important role in breakage at chromosome fragile sites; however, the sequences susceptible to double-strand breaks (DSBs) underlying CNV formation are largely unknown. Here we analyze 140 germline CNV breakpoints from 116 individuals to identify DNA sequences enriched at breakpoint loci compared to 2800 simulated control regions. We find that, overall, CNV breakpoints are enriched in tandem repeats and sequences predicted to form G-quadruplexes. G-rich repeats are overrepresented at terminal deletion breakpoints, which may be important for the addition of a new telomere. Interstitial deletions and duplication breakpoints are enriched in Alu repeats that in some cases mediate non-allelic homologous recombination (NAHR) between the two sides of the rearrangement. CNV breakpoints are enriched in certain classes of repeats that may play a role in DNA secondary structure, DSB susceptibility and/or DNA replication errors.

SUBMITTER: Bose P 

PROVIDER: S-EPMC4090240 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Tandem repeats and G-rich sequences are enriched at human CNV breakpoints.

Bose Promita P   Hermetz Karen E KE   Conneely Karen N KN   Rudd M Katharine MK  

PloS one 20140701 7


Chromosome breakage in germline and somatic genomes gives rise to copy number variation (CNV) responsible for genomic disorders and tumorigenesis. DNA sequence is known to play an important role in breakage at chromosome fragile sites; however, the sequences susceptible to double-strand breaks (DSBs) underlying CNV formation are largely unknown. Here we analyze 140 germline CNV breakpoints from 116 individuals to identify DNA sequences enriched at breakpoint loci compared to 2800 simulated contr  ...[more]

Similar Datasets

| S-EPMC1135851 | biostudies-other
| S-EPMC6047738 | biostudies-literature
| S-EPMC148217 | biostudies-other
| S-EPMC8269118 | biostudies-literature
| S-EPMC7302867 | biostudies-literature
| S-EPMC2928500 | biostudies-literature
| S-EPMC1471791 | biostudies-literature
| S-EPMC3402919 | biostudies-literature
| S-EPMC4153896 | biostudies-literature
| S-EPMC2475761 | biostudies-literature