Dataset Information

Genome-wide identification of significant aberrations in cancer genome.

ABSTRACT:

Background

Somatic Copy Number Alterations (CNAs) in human genomes are present in almost all human cancers. Systematic efforts to characterize such structural variants must effectively distinguish significant consensus events from random background aberrations. Here we introduce Significant Aberration in Cancer (SAIC), a new method for characterizing and assessing the statistical significance of recurrent CNA units. Three main features of SAIC include: (1) exploiting the intrinsic correlation among consecutive probes to assign a score to each CNA unit instead of single probes; (2) performing permutations on CNA units that preserve correlations inherent in the copy number data; and (3) iteratively detecting Significant Copy Number Aberrations (SCAs) and estimating an unbiased null distribution by applying an SCA-exclusive permutation scheme.

Results

We test and compare the performance of SAIC against four peer methods (GISTIC, STAC, KC-SMART, CMDS) on a large number of simulation datasets. Experimental results show that SAIC outperforms peer methods in terms of larger area under the Receiver Operating Characteristics curve and increased detection power. We then apply SAIC to analyze structural genomic aberrations acquired in four real cancer genome-wide copy number data sets (ovarian cancer, metastatic prostate cancer, lung adenocarcinoma, glioblastoma). When compared with previously reported results, SAIC successfully identifies most SCAs known to be of biological significance and associated with oncogenes (e.g., KRAS, CCNE1, and MYC) or tumor suppressor genes (e.g., CDKN2A/B). Furthermore, SAIC identifies a number of novel SCAs in these copy number data that encompass tumor related genes and may warrant further studies.

Conclusions

Supported by a well-grounded theoretical framework, SAIC has been developed and used to identify SCAs in various cancer copy number data sets, providing useful information to study the landscape of cancer genomes. Open-source and platform-independent SAIC software is implemented using C++, together with R scripts for data formatting and Perl scripts for user interfacing, and it is easy to install and efficient to use. The source code and documentation are freely available at http://www.cbil.ece.vt.edu/software.htm.

SUBMITTER: Yuan X

PROVIDER: S-EPMC3428679 | biostudies-literature | 2012 Jul

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Genome-wide identification of significant aberrations in cancer genome.

Yuan Xiguo X Yu Guoqiang G Hou Xuchu X Shih Ie-Ming IeM Clarke Robert R Zhang Junying J Hoffman Eric P EP Wang Roger R RR Zhang Zhen Z Wang Yue Y

BMC genomics 20120727

<h4>Background</h4>Somatic Copy Number Alterations (CNAs) in human genomes are present in almost all human cancers. Systematic efforts to characterize such structural variants must effectively distinguish significant consensus events from random background aberrations. Here we introduce Significant Aberration in Cancer (SAIC), a new method for characterizing and assessing the statistical significance of recurrent CNA units. Three main features of SAIC include: (1) exploiting the intrinsic correl ...[more]

PMID: 22839576

Dataset Information

Genome-wide identification of significant aberrations in cancer genome.

Background

Results

Conclusions

Publications

Genome-wide identification of significant aberrations in cancer genome.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Genome-wide identification of somatic aberrations from paired normal-tumor samples.
| S-EPMC3907544 | biostudies-literature

Genome-wide mutational spectra analysis reveals significant cancer-specific heterogeneity.
| S-EPMC4515826 | biostudies-literature

AISAIC: a software suite for accurate identification of significant aberrations in cancers.
| S-EPMC3904524 | biostudies-literature

Genome-wide genetic and epigenetic analyses of pancreatic acinar cell carcinomas reveal aberrations in genome stability.
| S-EPMC5673892 | biostudies-other

Genome-wide significant loci for addiction and anxiety.
| S-EPMC5483998 | biostudies-literature

Genome-Wide Computational Identification of Biologically Significant Cis-Regulatory Elements and Associated Transcription Factors from Rice.
| S-EPMC6918188 | biostudies-literature

Noninvasive detection of cancer-associated genome-wide hypomethylation and copy number aberrations by plasma DNA bisulfite sequencing.
| S-EPMC3839703 | biostudies-literature

Genome-wide significant association between a sequence variant at 15q15.2 and lung cancer risk.
| S-EPMC3077097 | biostudies-literature

Parametric Linkage Analysis Identifies Five Novel Genome-Wide Significant Loci for Familial Lung Cancer.
| S-EPMC8459795 | biostudies-literature

Evaluation of significant genome-wide association studies risk - SNPs in young breast cancer patients.
| S-EPMC6534300 | biostudies-literature