Unknown

Dataset Information

0

Detection of repeat expansions in large next generation DNA and RNA sequencing data without alignment.


ABSTRACT: Bioinformatic methods for detecting short tandem repeat expansions in short-read sequencing have identified new repeat expansions in humans, but require alignment information to identify repetitive motif enrichment at genomic locations. We present superSTR, an ultrafast method that does not require alignment. superSTR is used to process whole-genome and whole-exome sequencing data, and perform the first STR analysis of the UK Biobank, efficiently screening and identifying known and potential disease-associated STRs in the exomes of 49,953 biobank participants. We demonstrate the first bioinformatic screening of RNA sequencing data to detect repeat expansions in humans and mouse models of ataxia and dystrophy.

SUBMITTER: Fearnley LG 

PROVIDER: S-EPMC9338934 | biostudies-literature | 2022 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detection of repeat expansions in large next generation DNA and RNA sequencing data without alignment.

Fearnley L G LG   Bennett M F MF   Bahlo M M  

Scientific reports 20220730 1


Bioinformatic methods for detecting short tandem repeat expansions in short-read sequencing have identified new repeat expansions in humans, but require alignment information to identify repetitive motif enrichment at genomic locations. We present superSTR, an ultrafast method that does not require alignment. superSTR is used to process whole-genome and whole-exome sequencing data, and perform the first STR analysis of the UK Biobank, efficiently screening and identifying known and potential dis  ...[more]

Similar Datasets

| S-EPMC6008857 | biostudies-literature
| S-EPMC4265526 | biostudies-literature
| S-EPMC3581251 | biostudies-literature
| S-EPMC2943993 | biostudies-literature
2017-04-03 | PXD003804 | Pride
| S-EPMC3437896 | biostudies-other
| S-EPMC4105452 | biostudies-other
| S-EPMC4984844 | biostudies-literature
| S-EPMC6050050 | biostudies-literature
| S-EPMC3708773 | biostudies-literature