Unknown

Dataset Information

0

Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection.


ABSTRACT: In the past scientists reported summaries of their findings; they did not provide their original data collections. Many stakeholders (e.g., funding agencies) are now requesting that such data be made publicly available. This mandate is being adopted to facilitate further discovery, and to mitigate waste and deficits in the research process. At the same time, the necessary infrastructure for data curation (e.g., repositories) has been evolving. The current target is to make research products FAIR (Findable, Accessible, Interoperable, Reusable), resulting in data that are curated and archived to be both human and machine compatible. However, most scientists have little training in data curation. Specifically, they are ill-equipped to annotate their data collections at a level that facilitates discoverability, aggregation, and broad reuse in a context separate from their creation or sub-field. To circumvent these deficits data architects may collaborate with scientists to transform and curate data. This paper's example of a data collection describes the electrical properties of outer hair cells isolated from the mammalian cochlea. The data is expressed with a variant of The Ontology for Biomedical Investigations (OBI), mirrored to provide the metadata and nested data architecture used within the Hierarchical Data Format version 5 (HDF5) format. Each digital specimen is displayed in a tree configuration (like directories in a computer) and consists of six main branches based on the ontology classes. The data collections, scripts, and ontological OWL file (OBI based Inner Ear Electrophysiology (OBI_IEE)) are deposited in three repositories. We discuss the impediments to producing such data collections for public use, and the tools and processes required for effective implementation. This work illustrates the impact that small collaborations can have on the curation of our publicly-funded collections, and is particularly salient for fields where data is sparse, throughput is low, and sacrifice of animals is required for discovery.

SUBMITTER: Farrell B 

PROVIDER: S-EPMC6799921 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection.

Farrell Brenda B   Bengtson Jason J  

PloS one 20191018 10


In the past scientists reported summaries of their findings; they did not provide their original data collections. Many stakeholders (e.g., funding agencies) are now requesting that such data be made publicly available. This mandate is being adopted to facilitate further discovery, and to mitigate waste and deficits in the research process. At the same time, the necessary infrastructure for data curation (e.g., repositories) has been evolving. The current target is to make research products FAIR  ...[more]

Similar Datasets

| S-EPMC10899168 | biostudies-literature
2016-02-29 | E-GEOD-73829 | biostudies-arrayexpress
2016-12-31 | GSE73828 | GEO
2016-02-29 | GSE73829 | GEO
2017-04-24 | PXD003379 | Pride
2006-10-01 | GSE4965 | GEO
| S-EPMC4035935 | biostudies-literature
2012-05-04 | E-GEOD-37767 | biostudies-arrayexpress
2017-08-01 | GSE69546 | GEO
2012-05-05 | GSE37767 | GEO