Unknown

Dataset Information

0

Reliable identification of genomic variants from RNA-seq data.


ABSTRACT: Identifying genomic variation is a crucial step for unraveling the relationship between genotype and phenotype and can yield important insights into human diseases. Prevailing methods rely on cost-intensive whole-genome sequencing (WGS) or whole-exome sequencing (WES) approaches while the identification of genomic variants from often existing RNA sequencing (RNA-seq) data remains a challenge because of the intrinsic complexity in the transcriptome. Here, we present a highly accurate approach termed SNPiR to identify SNPs in RNA-seq data. We applied SNPiR to RNA-seq data of samples for which WGS and WES data are also available and achieved high specificity and sensitivity. Of the SNPs called from the RNA-seq data, >98% were also identified by WGS or WES. Over 70% of all expressed coding variants were identified from RNA-seq, and comparable numbers of exonic variants were identified in RNA-seq and WES. Despite our method's limitation in detecting variants in expressed regions only, our results demonstrate that SNPiR outperforms current state-of-the-art approaches for variant detection from RNA-seq data and offers a cost-effective and reliable alternative for SNP discovery.

SUBMITTER: Piskol R 

PROVIDER: S-EPMC3791257 | biostudies-literature | 2013 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Reliable identification of genomic variants from RNA-seq data.

Piskol Robert R   Ramaswami Gokul G   Li Jin Billy JB  

American journal of human genetics 20130926 4


Identifying genomic variation is a crucial step for unraveling the relationship between genotype and phenotype and can yield important insights into human diseases. Prevailing methods rely on cost-intensive whole-genome sequencing (WGS) or whole-exome sequencing (WES) approaches while the identification of genomic variants from often existing RNA sequencing (RNA-seq) data remains a challenge because of the intrinsic complexity in the transcriptome. Here, we present a highly accurate approach ter  ...[more]

Similar Datasets

| S-EPMC9279659 | biostudies-literature
| S-EPMC2863065 | biostudies-literature
| S-EPMC4132698 | biostudies-literature
| S-EPMC6360649 | biostudies-literature
| S-EPMC5568602 | biostudies-literature
| S-EPMC5144000 | biostudies-literature
| S-EPMC4147886 | biostudies-literature
| S-EPMC7088422 | biostudies-literature
| S-EPMC5860558 | biostudies-other
| S-EPMC8797241 | biostudies-literature