Unknown

Dataset Information

0

Direct genome-wide identification of G-quadruplex structures by whole-genome resequencing.


ABSTRACT: We present a user-friendly and transferable genome-wide DNA G-quadruplex (G4) profiling method that identifies G4 structures from ordinary whole-genome resequencing data by seizing the slight fluctuation of sequencing quality. In the human genome, 736,689 G4 structures were identified, of which 45.9% of all predicted canonical G4-forming sequences were characterized. Over 89% of the detected canonical G4s were also identified by combining polymerase stop assays with next-generation sequencing. Testing using public datasets of 6 species demonstrated that the present method is widely applicable. The detection rates of predicted canonical quadruplexes ranged from 32% to 58%. Because single nucleotide variations (SNVs) influence the formation of G4 structures and have individual differences, the given method is available to identify and characterize G4s genome-wide for specific individuals.

SUBMITTER: Tu J 

PROVIDER: S-EPMC8516911 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

2020-10-10 | GSE159307 | GEO
| S-EPMC8860588 | biostudies-literature
| S-EPMC4279814 | biostudies-literature
| S-EPMC5513431 | biostudies-other
| S-EPMC9490000 | biostudies-literature
| S-EPMC7333386 | biostudies-literature
2021-04-27 | GSE173103 | GEO
| S-EPMC9902616 | biostudies-literature
| S-EPMC7483440 | biostudies-literature
| S-EPMC4021222 | biostudies-literature