Unknown

Dataset Information

0

Future directions for high-throughput splicing assays in precision medicine.


ABSTRACT: Classification of variants of unknown significance is a challenging technical problem in clinical genetics. As up to one-third of disease-causing mutations are thought to affect pre-mRNA splicing, it is important to accurately classify splicing mutations in patient sequencing data. Several consortia and healthcare systems have conducted large-scale patient sequencing studies, which discover novel variants faster than they can be classified. Here, we compare the advantages and limitations of several high-throughput splicing assays aimed at mitigating this bottleneck, and describe a data set of ~5,000 variants that we analyzed using our Massively Parallel Splicing Assay (MaPSy). The Critical Assessment of Genome Interpretation group (CAGI) organized a challenge, in which participants submitted machine learning models to predict the splicing effects of variants in this data set. We discuss the winning submission of the challenge (MMSplice) which outperformed existing software. Finally, we highlight methods to overcome the limitations of MaPSy and similar assays, such as tissue-specific splicing, the effect of surrounding sequence context, classifying intronic variants, synthesizing large exons, and amplifying complex libraries of minigene species. Further development of these assays will greatly benefit the field of clinical genetics, which lack high-throughput methods for variant interpretation.

SUBMITTER: Rhine CL 

PROVIDER: S-EPMC6744296 | biostudies-literature | 2019 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Future directions for high-throughput splicing assays in precision medicine.

Rhine Christy L CL   Neil Christopher C   Glidden David T DT   Cygan Kamil J KJ   Fredericks Alger M AM   Wang Jing J   Walton Nephi A NA   Fairbrother William G WG  

Human mutation 20190817 9


Classification of variants of unknown significance is a challenging technical problem in clinical genetics. As up to one-third of disease-causing mutations are thought to affect pre-mRNA splicing, it is important to accurately classify splicing mutations in patient sequencing data. Several consortia and healthcare systems have conducted large-scale patient sequencing studies, which discover novel variants faster than they can be classified. Here, we compare the advantages and limitations of seve  ...[more]

Similar Datasets

| S-EPMC10222089 | biostudies-literature
| S-EPMC8345034 | biostudies-literature
| S-EPMC5506495 | biostudies-literature
| S-EPMC8634105 | biostudies-literature
| S-EPMC7322755 | biostudies-literature
| S-EPMC4664309 | biostudies-other
| S-EPMC7092395 | biostudies-literature
| S-EPMC4538440 | biostudies-literature
| S-EPMC8044947 | biostudies-literature
| S-EPMC5560434 | biostudies-literature