Unknown

Dataset Information

0

Generation of open biomedical datasets through ontology-driven transformation and integration processes.


ABSTRACT: Biomedical research usually requires combining large volumes of data from multiple heterogeneous sources, which makes difficult the integrated exploitation of such data. The Semantic Web paradigm offers a natural technological space for data integration and exploitation by generating content readable by machines. Linked Open Data is a Semantic Web initiative that promotes the publication and sharing of data in machine readable semantic formats.We present an approach for the transformation and integration of heterogeneous biomedical data with the objective of generating open biomedical datasets in Semantic Web formats. The transformation of the data is based on the mappings between the entities of the data schema and the ontological infrastructure that provides the meaning to the content. Our approach permits different types of mappings and includes the possibility of defining complex transformation patterns. Once the mappings are defined, they can be automatically applied to datasets to generate logically consistent content and the mappings can be reused in further transformation processes.The results of our research are (1) a common transformation and integration process for heterogeneous biomedical data; (2) the application of Linked Open Data principles to generate interoperable, open, biomedical datasets; (3) a software tool, called SWIT, that implements the approach. In this paper we also describe how we have applied SWIT in different biomedical scenarios and some lessons learned.We have presented an approach that is able to generate open biomedical repositories in Semantic Web formats. SWIT is able to apply the Linked Open Data principles in the generation of the datasets, so allowing for linking their content to external repositories and creating linked open datasets. SWIT datasets may contain data from multiple sources and schemas, thus becoming integrated datasets.

SUBMITTER: Carmen Legaz-Garcia MD 

PROVIDER: S-EPMC4891880 | biostudies-literature | 2016 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Generation of open biomedical datasets through ontology-driven transformation and integration processes.

Carmen Legaz-García María Del MD   Miñarro-Giménez José Antonio JA   Menárguez-Tortosa Marcos M   Fernández-Breis Jesualdo Tomás JT  

Journal of biomedical semantics 20160603


<h4>Background</h4>Biomedical research usually requires combining large volumes of data from multiple heterogeneous sources, which makes difficult the integrated exploitation of such data. The Semantic Web paradigm offers a natural technological space for data integration and exploitation by generating content readable by machines. Linked Open Data is a Semantic Web initiative that promotes the publication and sharing of data in machine readable semantic formats.<h4>Methods</h4>We present an app  ...[more]

Similar Datasets

| S-EPMC4448321 | biostudies-literature
| S-EPMC2646250 | biostudies-literature
| S-EPMC8591903 | biostudies-literature
| S-EPMC7378878 | biostudies-literature
| S-EPMC2762998 | biostudies-literature
| S-EPMC5018193 | biostudies-literature
| S-EPMC4851331 | biostudies-literature
| S-EPMC6971239 | biostudies-literature
| S-EPMC3621846 | biostudies-other
| S-EPMC5463318 | biostudies-literature