Unknown

Dataset Information

0

OTG-snpcaller: an optimized pipeline based on TMAP and GATK for SNP calling from ion torrent data.


ABSTRACT: Because the new Proton platform from Life Technologies produced markedly different data from those of the Illumina platform, the conventional Illumina data analysis pipeline could not be used directly. We developed an optimized SNP calling method using TMAP and GATK (OTG-snpcaller). This method combined our own optimized processes, Remove Duplicates According to AS Tag (RDAST) and Alignment Optimize Structure (AOS), together with TMAP and GATK, to call SNPs from Proton data. We sequenced four sets of exomes captured by Agilent SureSelect and NimbleGen SeqCap EZ Kit, using Life Technology's Ion Proton sequencer. Then we applied OTG-snpcaller and compared our results with the results from Torrent Variants Caller. The results indicated that OTG-snpcaller can reduce both false positive and false negative rates. Moreover, we compared our results with Illumina results generated by GATK best practices, and we found that the results of these two platforms were comparable. The good performance in variant calling using GATK best practices can be primarily attributed to the high quality of the Illumina sequences.

SUBMITTER: Zhu P 

PROVIDER: S-EPMC4019570 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

OTG-snpcaller: an optimized pipeline based on TMAP and GATK for SNP calling from ion torrent data.

Zhu Pengyuan P   He Lingyu L   Li Yaqiao Y   Huang Wenpan W   Xi Feng F   Lin Lin L   Zhi Qihuan Q   Zhang Wenwei W   Tang Y Tom YT   Geng Chunyu C   Lu Zhiyuan Z   Xu Xun X  

PloS one 20140513 5


Because the new Proton platform from Life Technologies produced markedly different data from those of the Illumina platform, the conventional Illumina data analysis pipeline could not be used directly. We developed an optimized SNP calling method using TMAP and GATK (OTG-snpcaller). This method combined our own optimized processes, Remove Duplicates According to AS Tag (RDAST) and Alignment Optimize Structure (AOS), together with TMAP and GATK, to call SNPs from Proton data. We sequenced four se  ...[more]

Similar Datasets

| S-EPMC8361789 | biostudies-literature
| S-EPMC3711422 | biostudies-literature
| S-EPMC3711900 | biostudies-literature
| S-EPMC5123378 | biostudies-literature
| S-EPMC5374681 | biostudies-literature
| S-EPMC4673977 | biostudies-literature
| S-EPMC3491382 | biostudies-literature
| PRJEB2440 | ENA
| PRJEB2441 | ENA
| PRJEB2442 | ENA