Dataset Information

Re-sampling strategy to improve the estimation of number of null hypotheses in FDR control under strong correlation structures.

ABSTRACT:

Background

When conducting multiple hypothesis tests, it is important to control the number of false positives, or the False Discovery Rate (FDR). However, there is a tradeoff between controlling FDR and maximizing power. Several methods have been proposed, such as the q-value method, to estimate the proportion of true null hypothesis among the tested hypotheses, and use this estimation in the control of FDR. These methods usually depend on the assumption that the test statistics are independent (or only weakly correlated). However, many types of data, for example microarray data, often contain large scale correlation structures. Our objective was to develop methods to control the FDR while maintaining a greater level of power in highly correlated datasets by improving the estimation of the proportion of null hypotheses.

Results

We showed that when strong correlation exists among the data, which is common in microarray datasets, the estimation of the proportion of null hypotheses could be highly variable resulting in a high level of variation in the FDR. Therefore, we developed a re-sampling strategy to reduce the variation by breaking the correlations between gene expression values, then using a conservative strategy of selecting the upper quartile of the re-sampling estimations to obtain a strong control of FDR.

Conclusion

With simulation studies and perturbations on actual microarray datasets, our method, compared to competing methods such as q-value, generated slightly biased estimates on the proportion of null hypotheses but with lower mean square errors. When selecting genes with controlling the same FDR level, our methods have on average a significantly lower false discovery rate in exchange for a minor reduction in the power.

SUBMITTER: Lu X

PROVIDER: S-EPMC1890303 | biostudies-literature | 2007 May

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Re-sampling strategy to improve the estimation of number of null hypotheses in FDR control under strong correlation structures.

Lu Xin X Perkins David L DL

BMC bioinformatics 20070518

<h4>Background</h4>When conducting multiple hypothesis tests, it is important to control the number of false positives, or the False Discovery Rate (FDR). However, there is a tradeoff between controlling FDR and maximizing power. Several methods have been proposed, such as the q-value method, to estimate the proportion of true null hypothesis among the tested hypotheses, and use this estimation in the control of FDR. These methods usually depend on the assumption that the test statistics are ind ...[more]

PMID: 17509157

Similar Datasets

Project description:Background: With the increasing use of mycophenolic acid (MPA) formulations in organ transplantation, the need for personalized immunosuppressive therapy has become well recognized based on therapeutic drug monitoring (TDM) for avoidance of drug-related toxicity while maintaining efficacy. Few studies have assessed area under the 12 h concentration-time curve of MPA (MPA-AUC0-12h) in heart transplant recipients who received mycophenolate mofetil (MMF) dispersible tablets (MMFdt). The aim of the study was to investigate the pharmacokinetics (PK) of MMFdt combined with tacrolimus and further to develop a practical method for estimation of MPA-AUC0-12h using a limited sampling strategy (LSS). Methods: A prospective study in a single center was performed in patients who continuously administrated with MMFdt or MMF capsule (MMFc) for at least 7 days after cardiac transplantation from 2018 to 2020. A total of 48 Chinese adult heart transplant recipients were enrolled. Blood samples were collected before and 0.5, 1, 1.5, 2, 4, 6, 8, 10 and 12 h after MMF administration. The validated high-performance liquid chromatography combined with tandem mass spectrometry method was used to measure MPA concentrations. Non-compartmental pharmacokinetic (PK) analysis was applied to calculate the data obtained from individual recipients by WinNonlin. LSS models were developed for MPA-AUC0-12h prediction with multivariate stepwise regression analysis. Results: A large inter-individual variability was observed in AUC0-12h, Tmax, Cmax, MRT0-12h, t1/2 and CL/F after multiple dosing of MMFdt. However, no significant differences were observed between main PK parameters of MMFdt and MMFc. The best estimation of MPA-AUC0-12h was achieved with four points: MPA-AUC0-12h = 8.424 + 0.781 × C0.5 + 1.263 × C2 + 1.660 × C4 + 3.022 × C6 (R 2 = 0.844). The mean prediction error (MPE) and mean absolute prediction error (MAPE) of MPA-AUC0-12h were 2.09 ± 14.05% and 11.17 ± 8.52%, respectively. Both internal and external validations showed good applicability for four-point LSS equation. Conclusion: The results provide strong evidence for the use of LSS model other than a single time-point concentration of MPA when performing TDM. A four-point LSS equation using the concentrations at 0.5, 2, 4, 6 h is recommended to estimate MPA-AUC0-12h during early period after transplantation in Chinese adult heart transplant recipients receiving MMFdt or MMFc. However, proper internal and external validations with more patients should be conducted in the future.

Dataset Information

Re-sampling strategy to improve the estimation of number of null hypotheses in FDR control under strong correlation structures.

Background

Results

Conclusion

Publications

Re-sampling strategy to improve the estimation of number of null hypotheses in FDR control under strong correlation structures.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets