Unknown

Dataset Information

0

Selection-Corrected Statistical Inference for Region Detection With High-Throughput Assays.


ABSTRACT: Scientists use high-dimensional measurement assays to detect and prioritize regions of strong signal in spatially organized domain. Examples include finding methylation enriched genomic regions using microarrays, and active cortical areas using brain-imaging. The most common procedure for detecting potential regions is to group neighboring sites where the signal passed a threshold. However, one needs to account for the selection bias induced by this procedure to avoid diminishing effects when generalizing to a population. This paper introduces pin-down inference, a model and an inference framework that permit population inference for these detected regions. Pin-down inference provides non-asymptotic point and confidence interval estimators for the mean effect in the region that account for local selection bias. Our estimators accommodate non-stationary covariances that are typical of these data, allowing researchers to better compare regions of different sizes and correlation structures. Inference is provided within a conditional one-parameter exponential family per region, with truncations that match the selection constraints. A secondary screening-and-adjustment step allows pruning the set of detected regions, while controlling the false-coverage rate over the reported regions. We apply the method to genomic regions with differing DNA-methylation rates across tissue. Our method provides superior power compared to other conditional and non-parametric approaches.

SUBMITTER: Benjamini Y 

PROVIDER: S-EPMC9615469 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

Selection-Corrected Statistical Inference for Region Detection With High-Throughput Assays.

Benjamini Yuval Y   Taylor Jonathan J   Irizarry Rafael A RA  

Journal of the American Statistical Association 20181113 527


Scientists use high-dimensional measurement assays to detect and prioritize regions of strong signal in spatially organized domain. Examples include finding methylation enriched genomic regions using microarrays, and active cortical areas using brain-imaging. The most common procedure for detecting potential regions is to group neighboring sites where the signal passed a threshold. However, one needs to account for the selection bias induced by this procedure to avoid diminishing effects when ge  ...[more]

Similar Datasets

| S-EPMC10614806 | biostudies-literature
| S-EPMC10971673 | biostudies-literature
| S-EPMC5299465 | biostudies-literature
| S-EPMC6477983 | biostudies-literature
| S-EPMC2722027 | biostudies-literature
| S-EPMC7712650 | biostudies-literature
| S-EPMC6145075 | biostudies-literature
| S-EPMC4379180 | biostudies-literature
| S-EPMC7423900 | biostudies-literature
| S-EPMC7327253 | biostudies-literature