Unknown

Dataset Information

0

BiotoolsSchema: a formalized schema for bioinformatics software description.


ABSTRACT:

Background

Life scientists routinely face massive and heterogeneous data analysis tasks and must find and access the most suitable databases or software in a jungle of web-accessible resources. The diversity of information used to describe life-scientific digital resources presents an obstacle to their utilization. Although several standardization efforts are emerging, no information schema has been sufficiently detailed to enable uniform semantic and syntactic description-and cataloguing-of bioinformatics resources.

Findings

Here we describe biotoolsSchema, a formalized information model that balances the needs of conciseness for rapid adoption against the provision of rich technical information and scientific context. biotoolsSchema results from a series of community-driven workshops and is deployed in the bio.tools registry, providing the scientific community with >17,000 machine-readable and human-understandable descriptions of software and other digital life-science resources. We compare our approach to related initiatives and provide alignments to foster interoperability and reusability.

Conclusions

biotoolsSchema supports the formalized, rigorous, and consistent specification of the syntax and semantics of bioinformatics resources, and enables cataloguing efforts such as bio.tools that help scientists to find, comprehend, and compare resources. The use of biotoolsSchema in bio.tools promotes the FAIRness of research software, a key element of open and reproducible developments for data-intensive sciences.

SUBMITTER: Ison J 

PROVIDER: S-EPMC7842104 | biostudies-literature | 2021 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

biotoolsSchema: a formalized schema for bioinformatics software description.

Ison Jon J   Ienasescu Hans H   Rydza Emil E   Chmura Piotr P   Rapacki Kristoffer K   Gaignard Alban A   Schwämmle Veit V   van Helden Jacques J   Kalaš Matúš M   Ménager Hervé H  

GigaScience 20210101 1


<h4>Background</h4>Life scientists routinely face massive and heterogeneous data analysis tasks and must find and access the most suitable databases or software in a jungle of web-accessible resources. The diversity of information used to describe life-scientific digital resources presents an obstacle to their utilization. Although several standardization efforts are emerging, no information schema has been sufficiently detailed to enable uniform semantic and syntactic description-and cataloguin  ...[more]

Similar Datasets

| S-EPMC3194197 | biostudies-literature
| S-EPMC5537105 | biostudies-other
| S-EPMC8771759 | biostudies-literature
| S-EPMC3294241 | biostudies-literature
| S-EPMC7305685 | biostudies-literature
| S-EPMC7172532 | biostudies-literature
| S-EPMC7031678 | biostudies-literature
| S-EPMC6738188 | biostudies-literature
| S-EPMC9176277 | biostudies-literature
| S-EPMC5725956 | biostudies-literature