Unknown

Dataset Information

0

Codabench: Flexible, easy-to-use, and reproducible meta-benchmark platform


ABSTRACT: Summary Obtaining a standardized benchmark of computational methods is a major issue in data-science communities. Dedicated frameworks enabling fair benchmarking in a unified environment are yet to be developed. Here, we introduce Codabench, a meta-benchmark platform that is open sourced and community driven for benchmarking algorithms or software agents versus datasets or tasks. A public instance of Codabench is open to everyone free of charge and allows benchmark organizers to fairly compare submissions under the same setting (software, hardware, data, algorithms), with custom protocols and data formats. Codabench has unique features facilitating easy organization of flexible and reproducible benchmarks, such as the possibility of reusing templates of benchmarks and supplying compute resources on demand. Codabench has been used internally and externally on various applications, receiving more than 130 users and 2,500 submissions. As illustrative use cases, we introduce four diverse benchmarks covering graph machine learning, cancer heterogeneity, clinical diagnosis, and reinforcement learning. Highlights • Codabench facilitates flexible, easy, and reproducible benchmarking• Organizers can customize benchmark design and submission format• Organizers may host their own platform instance or use the public instance• Four use cases in diverse domains are introduced to demonstrate the key features The bigger picture In almost all communities working on data science, researchers face increasingly severe issues of reproducibility and fair comparison. Researchers work on their own version of hardware/software environment, code, and data, and consequently, the published results are hardly comparable. We introduce Codabench, a meta-benchmark platform, that is capable of flexible and easy benchmarking and supports reproducibility. Codabench is an important step toward benchmarking and reproducible research. It has been used in various communities including graph machine learning, cancer heterogeneity, clinical diagnosis, and reinforcement learning. Codabench is ready to help trendy research, e.g., artificial intelligence (AI) for science and data-centric AI. Fair and flexible benchmarking is a common issue in data-science communities. We develop the Codabench platform for flexible, easy, and reproducible benchmarking. It is open sourced and community driven. With Codabench, we are able to fairly and easily compare algorithms as well as datasets under diverse protocols. The reproducibility is also guaranteed.

SUBMITTER: Xu Z 

PROVIDER: S-EPMC9278500 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC9016988 | biostudies-literature
2019-01-21 | GSE114720 | GEO
| S-EPMC8028061 | biostudies-literature
| S-EPMC6431032 | biostudies-literature
| S-EPMC10027429 | biostudies-literature
| S-EPMC9642353 | biostudies-literature
| S-EPMC6571143 | biostudies-literature
| S-EPMC6992939 | biostudies-literature
| S-EPMC7520042 | biostudies-literature
2013-07-01 | E-GEOD-45860 | biostudies-arrayexpress