Unknown

Dataset Information

0

QiSampler: evaluation of scoring schemes for high-throughput datasets using a repetitive sampling strategy on gold standards.


ABSTRACT: BACKGROUND: High-throughput biological experiments can produce a large amount of data showing little overlap with current knowledge. This may be a problem when evaluating alternative scoring mechanisms for such data according to a gold standard dataset because standard statistical tests may not be appropriate. FINDINGS: To address this problem we have implemented the QiSampler tool that uses a repetitive sampling strategy to evaluate several scoring schemes or experimental parameters for any type of high-throughput data given a gold standard. We provide two example applications of the tool: selection of the best scoring scheme for a high-throughput protein-protein interaction dataset by comparison to a dataset derived from the literature, and evaluation of functional enrichment in a set of tumour-related differentially expressed genes from a thyroid microarray dataset. CONCLUSIONS: QiSampler is implemented as an open source R script and a web server, which can be accessed at http://cbdm.mdc-berlin.de/tools/sampler/.

SUBMITTER: Fontaine JF 

PROVIDER: S-EPMC3060832 | biostudies-literature | 2011

REPOSITORIES: biostudies-literature

altmetric image

Publications

QiSampler: evaluation of scoring schemes for high-throughput datasets using a repetitive sampling strategy on gold standards.

Fontaine Jean F JF   Suter Bernhard B   Andrade-Navarro Miguel A MA  

BMC research notes 20110309


<h4>Background</h4>High-throughput biological experiments can produce a large amount of data showing little overlap with current knowledge. This may be a problem when evaluating alternative scoring mechanisms for such data according to a gold standard dataset because standard statistical tests may not be appropriate.<h4>Findings</h4>To address this problem we have implemented the QiSampler tool that uses a repetitive sampling strategy to evaluate several scoring schemes or experimental parameter  ...[more]

Similar Datasets

| S-EPMC4165770 | biostudies-literature
| S-EPMC1978211 | biostudies-literature
| S-EPMC8319594 | biostudies-literature
| S-EPMC4012149 | biostudies-literature
| S-EPMC5468358 | biostudies-literature
| S-EPMC9801984 | biostudies-literature
| S-EPMC2714558 | biostudies-literature
| S-EPMC2553011 | biostudies-literature
| S-EPMC3356839 | biostudies-other
| S-EPMC3626508 | biostudies-literature