Unknown

Dataset Information

0

A novel approach to minimize false discovery rate in genome-wide data analysis.


ABSTRACT: BACKGROUND: High-throughput technologies, such as DNA microarray, have significantly advanced biological and biomedical research by enabling researchers to carry out genome-wide screens. One critical task in analyzing genome-wide datasets is to control the false discovery rate (FDR) so that the proportion of false positive features among those called significant is restrained. Recently a number of FDR control methods have been proposed and widely practiced, such as the Benjamini-Hochberg approach, the Storey approach and Significant Analysis of Microarrays (SAM). METHODS: This paper presents a straight-forward yet powerful FDR control method termed miFDR, which aims to minimize FDR when calling a fixed number of significant features. We theoretically proved that the strategy used by miFDR is able to find the optimal number of significant features when the desired FDR is fixed. RESULTS: We compared miFDR with the BH approach, the Storey approach and SAM on both simulated datasets and public DNA microarray datasets. The results demonstrated that miFDR outperforms others by identifying more significant features under the same FDR cut-offs. Literature search showed that many genes called only by miFDR are indeed relevant to the underlying biology of interest. CONCLUSIONS: FDR has been widely applied to analyzing high-throughput datasets allowed for rapid discoveries. Under the same FDR threshold, miFDR is capable to identify more significant features than its competitors at a compatible level of complexity. Therefore, it can potentially generate great impacts on biological and biomedical research. AVAILABILITY: If interested, please contact the authors for getting miFDR.

SUBMITTER: Bei Y 

PROVIDER: S-EPMC3856609 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

A novel approach to minimize false discovery rate in genome-wide data analysis.

Bei Yuanzhe Y   Hong Pengyu P  

BMC systems biology 20131023


<h4>Background</h4>High-throughput technologies, such as DNA microarray, have significantly advanced biological and biomedical research by enabling researchers to carry out genome-wide screens. One critical task in analyzing genome-wide datasets is to control the false discovery rate (FDR) so that the proportion of false positive features among those called significant is restrained. Recently a number of FDR control methods have been proposed and widely practiced, such as the Benjamini-Hochberg  ...[more]

Similar Datasets

| S-EPMC4143671 | biostudies-literature
| S-EPMC4103587 | biostudies-literature
| S-EPMC8501795 | biostudies-literature
| S-EPMC3322139 | biostudies-literature
| S-EPMC3559028 | biostudies-literature
| S-EPMC4563723 | biostudies-literature
| S-EPMC6397176 | biostudies-literature
| S-EPMC3372940 | biostudies-literature
| S-EPMC3220955 | biostudies-literature
| S-EPMC8167059 | biostudies-literature