Unknown

Dataset Information

0

Whole genome and exome sequencing reference datasets from a multi-center and cross-platform benchmark study.


ABSTRACT: With the rapid advancement of sequencing technologies, next generation sequencing (NGS) analysis has been widely applied in cancer genomics research. More recently, NGS has been adopted in clinical oncology to advance personalized medicine. Clinical applications of precision oncology require accurate tests that can distinguish tumor-specific mutations from artifacts introduced during NGS processes or data analysis. Therefore, there is an urgent need to develop best practices in cancer mutation detection using NGS and the need for standard reference data sets for systematically measuring accuracy and reproducibility across platforms and methods. Within the SEQC2 consortium context, we established paired tumor-normal reference samples and generated whole-genome (WGS) and whole-exome sequencing (WES) data using sixteen library protocols, seven sequencing platforms at six different centers. We systematically interrogated somatic mutations in the reference samples to identify factors affecting detection reproducibility and accuracy in cancer genomes. These large cross-platform/site WGS and WES datasets using well-characterized reference samples will represent a powerful resource for benchmarking NGS technologies, bioinformatics pipelines, and for the cancer genomics studies.

SUBMITTER: Zhao Y 

PROVIDER: S-EPMC8578599 | biostudies-literature | 2021 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Whole genome and exome sequencing reference datasets from a multi-center and cross-platform benchmark study.

Zhao Yongmei Y   Fang Li Tai LT   Shen Tsai-Wei TW   Choudhari Sulbha S   Talsania Keyur K   Chen Xiongfong X   Shetty Jyoti J   Kriga Yuliya Y   Tran Bao B   Zhu Bin B   Chen Zhong Z   Chen Wanqiu W   Wang Charles C   Jaeger Erich E   Meerzaman Daoud D   Lu Charles C   Idler Kenneth K   Ren Luyao L   Zheng Yuanting Y   Shi Leming L   Petitjean Virginie V   Sultan Marc M   Hung Tiffany T   Peters Eric E   Drabek Jiri J   Vojta Petr P   Maestro Roberta R   Gasparotto Daniela D   Kõks Sulev S   Reimann Ene E   Scherer Andreas A   Nordlund Jessica J   Liljedahl Ulrika U   Foox Jonathan J   Mason Christopher E CE   Xiao Chunlin C   Hong Huixiao H   Xiao Wenming W  

Scientific data 20211109 1


With the rapid advancement of sequencing technologies, next generation sequencing (NGS) analysis has been widely applied in cancer genomics research. More recently, NGS has been adopted in clinical oncology to advance personalized medicine. Clinical applications of precision oncology require accurate tests that can distinguish tumor-specific mutations from artifacts introduced during NGS processes or data analysis. Therefore, there is an urgent need to develop best practices in cancer mutation d  ...[more]

Similar Datasets

| S-EPMC7854649 | biostudies-literature
| S-EPMC8135134 | biostudies-literature
| S-EPMC5888834 | biostudies-other
| S-EPMC3362899 | biostudies-other
| S-EPMC6160438 | biostudies-literature
| S-EPMC4526866 | biostudies-literature
| S-EPMC4253833 | biostudies-other
| S-EPMC7953489 | biostudies-literature
| S-EPMC7736078 | biostudies-literature
| S-EPMC10908857 | biostudies-literature