Unknown

Dataset Information

0

Long-Read Sequencing Improves the Detection of Structural Variations Impacting Complex Non-Coding Elements of the Genome.


ABSTRACT: The advent of long-read sequencing offers a new assessment method of detecting genomic structural variation (SV) in numerous rare genetic diseases. For autism spectrum disorders (ASD) cases where pathogenic variants fail to be found in the protein-coding genic regions along chromosomes, we proposed a scalable workflow to characterize the risk factor of SVs impacting non-coding elements of the genome. We applied whole-genome sequencing on an Emirati family having three children with ASD using long and short-read sequencing technology. A series of analytical pipelines were established to identify a set of SVs with high sensitivity and specificity. At 15-fold coverage, we observed that long-read sequencing technology (987 variants) detected a significantly higher number of SVs when compared to variants detected using short-read technology (509 variants) (p-value < 1.1020 × 10-57). Further comparison showed 97.9% of long-read sequencing variants were spanning within the 1-100 kb size range (p-value < 9.080 × 10-67) and impacting over 5000 genes. Moreover, long-read variants detected 604 non-coding RNAs (p-value < 9.02 × 10-9), comprising 58% microRNA, 31.9% lncRNA, and 9.1% snoRNA. Even at low coverage, long-read sequencing has shown to be a reliable technology in detecting SVs impacting complex elements of the genome.

SUBMITTER: Begum G 

PROVIDER: S-EPMC7923155 | biostudies-literature | 2021 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Long-Read Sequencing Improves the Detection of Structural Variations Impacting Complex Non-Coding Elements of the Genome.

Begum Ghausia G   Albanna Ammar A   Bankapur Asma A   Nassir Nasna N   Tambi Richa R   Berdiev Bakhrom K BK   Akter Hosneara H   Karuvantevida Noushad N   Kellam Barbara B   Alhashmi Deena D   Sung Wilson W L WWL   Thiruvahindrapuram Bhooma B   Alsheikh-Ali Alawi A   Scherer Stephen W SW   Uddin Mohammed M  

International journal of molecular sciences 20210219 4


The advent of long-read sequencing offers a new assessment method of detecting genomic structural variation (SV) in numerous rare genetic diseases. For autism spectrum disorders (ASD) cases where pathogenic variants fail to be found in the protein-coding genic regions along chromosomes, we proposed a scalable workflow to characterize the risk factor of SVs impacting non-coding elements of the genome. We applied whole-genome sequencing on an Emirati family having three children with ASD using lon  ...[more]

Similar Datasets

| S-EPMC10300613 | biostudies-literature
| S-EPMC5990442 | biostudies-literature
| S-EPMC10013895 | biostudies-literature
| S-EPMC10415145 | biostudies-literature
| S-EPMC8417358 | biostudies-literature
| S-EPMC6946643 | biostudies-literature
| S-EPMC8138798 | biostudies-literature
| S-EPMC11234778 | biostudies-literature
| S-EPMC11323893 | biostudies-literature
| S-EPMC8206509 | biostudies-literature