Unknown

Dataset Information

0

Data-driven unbiased curation of the TP53 tumor suppressor gene mutation database and validation by ultradeep sequencing of human tumors.


ABSTRACT: Cancer mutation databases are expected to play central roles in personalized medicine by providing targets for drug development and biomarkers to tailor treatments to each patient. The accuracy of reported mutations is a critical issue that is commonly overlooked, which leads to mutation databases that include a sizable number of spurious mutations, either sequencing errors or passenger mutations. Here we report an analysis of the latest version of the TP53 mutation database, including 34,453 mutations. By using several data-driven methods on multiple independent quality criteria, we obtained a quality score for each report contributing to the database. This score can now be used to filter for high-confidence mutations and reports within the database. Sequencing the entire TP53 gene from various types of cancer using next-generation sequencing with ultradeep coverage validated our approach for curation. In summary, 9.7% of all collected studies, mostly comprising numerous tumors with multiple infrequent TP53 mutations, should be excluded when analyzing TP53 mutations. Thus, by combining statistical and experimental analyses, we provide a curated mutation database for TP53 mutations and a framework for mutation database analysis.

SUBMITTER: Edlund K 

PROVIDER: S-EPMC3386058 | biostudies-literature | 2012 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Data-driven unbiased curation of the TP53 tumor suppressor gene mutation database and validation by ultradeep sequencing of human tumors.

Edlund Karolina K   Larsson Ola O   Ameur Adam A   Bunikis Ignas I   Gyllensten Ulf U   Leroy Bernard B   Sundström Magnus M   Micke Patrick P   Botling Johan J   Soussi Thierry T  

Proceedings of the National Academy of Sciences of the United States of America 20120524 24


Cancer mutation databases are expected to play central roles in personalized medicine by providing targets for drug development and biomarkers to tailor treatments to each patient. The accuracy of reported mutations is a critical issue that is commonly overlooked, which leads to mutation databases that include a sizable number of spurious mutations, either sequencing errors or passenger mutations. Here we report an analysis of the latest version of the TP53 mutation database, including 34,453 mu  ...[more]

Similar Datasets

| S-EPMC4374372 | biostudies-literature
| S-EPMC4457984 | biostudies-literature
| S-EPMC4303426 | biostudies-literature
| S-EPMC5584393 | biostudies-literature
| S-EPMC8960361 | biostudies-literature
| S-EPMC5492242 | biostudies-literature
2007-12-08 | GSE9734 | GEO
| S-EPMC4968595 | biostudies-literature
| S-EPMC5773052 | biostudies-literature
| S-EPMC7789813 | biostudies-literature