Unknown

Dataset Information

0

Efficient Mining of Variants From Trios for Ventricular Septal Defect Association Study.


ABSTRACT: Ventricular septal defect (VSD) is a fatal congenital heart disease showing severe consequence in affected infants. Early diagnosis plays an important role, particularly through genetic variants. Existing panel-based approaches of variants mining suffer from shortage of large panels, costly sequencing, and missing rare variants. Although a trio-based method alleviates these limitations to some extent, it is agnostic to novel mutations and computational intensive. Considering these limitations, we are studying a novel variants mining algorithm from trio-based sequencing data and apply it on a VSD trio to identify associated mutations. Our approach starts with irrelevant k-mer filtering from sequences of a trio via a newly conceived coupled Bloom Filter, then corrects sequencing errors by using a statistical approach and extends kept k-mers into long sequences. These extended sequences are used as input for variants needed. Later, the obtained variants are comprehensively analyzed against existing databases to mine VSD-related mutations. Experiments show that our trio-based algorithm narrows down candidate coding genes and lncRNAs by about 10- and 5-folds comparing with single sequence-based approaches, respectively. Meanwhile, our algorithm is 10 times faster and 2 magnitudes memory-frugal compared with existing state-of-the-art approach. By applying our approach to a VSD trio, we fish out an unreported gene-CD80, a combination of two genes-MYBPC3 and TRDN and a lncRNA-NONHSAT096266.2, which are highly likely to be VSD-related.

SUBMITTER: Jiang P 

PROVIDER: S-EPMC6694746 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

Efficient Mining of Variants From Trios for Ventricular Septal Defect Association Study.

Jiang Peng P   Hu Yaofei Y   Wang Yiqi Y   Zhang Jin J   Zhu Qinghong Q   Bai Lin L   Tong Qiang Q   Li Tao T   Zhao Liang L  

Frontiers in genetics 20190808


Ventricular septal defect (VSD) is a fatal congenital heart disease showing severe consequence in affected infants. Early diagnosis plays an important role, particularly through genetic variants. Existing panel-based approaches of variants mining suffer from shortage of large panels, costly sequencing, and missing rare variants. Although a trio-based method alleviates these limitations to some extent, it is agnostic to novel mutations and computational intensive. Considering these limitations, w  ...[more]

Similar Datasets

| S-EPMC1413599 | biostudies-literature
| S-EPMC4296393 | biostudies-literature
| S-EPMC4987479 | biostudies-literature
| S-EPMC8299768 | biostudies-literature
| S-EPMC6360179 | biostudies-literature
| S-EPMC9238637 | biostudies-literature
2014-02-10 | E-GEOD-54675 | biostudies-arrayexpress
| S-EPMC10107004 | biostudies-literature
| S-EPMC6200681 | biostudies-other
| S-EPMC4322411 | biostudies-literature