Dataset Information

Structuring and extracting knowledge for the support of hypothesis generation in molecular biology.

ABSTRACT:

Background

Hypothesis generation in molecular and cellular biology is an empirical process in which knowledge derived from prior experiments is distilled into a comprehensible model. The requirement of automated support is exemplified by the difficulty of considering all relevant facts that are contained in the millions of documents available from PubMed. Semantic Web provides tools for sharing prior knowledge, while information retrieval and information extraction techniques enable its extraction from literature. Their combination makes prior knowledge available for computational analysis and inference. While some tools provide complete solutions that limit the control over the modeling and extraction processes, we seek a methodology that supports control by the experimenter over these critical processes.

Results

We describe progress towards automated support for the generation of biomolecular hypotheses. Semantic Web technologies are used to structure and store knowledge, while a workflow extracts knowledge from text. We designed minimal proto-ontologies in OWL for capturing different aspects of a text mining experiment: the biological hypothesis, text and documents, text mining, and workflow provenance. The models fit a methodology that allows focus on the requirements of a single experiment while supporting reuse and posterior analysis of extracted knowledge from multiple experiments. Our workflow is composed of services from the 'Adaptive Information Disclosure Application' (AIDA) toolkit as well as a few others. The output is a semantic model with putative biological relations, with each relation linked to the corresponding evidence.

Conclusion

We demonstrated a 'do-it-yourself' approach for structuring and extracting knowledge in the context of experimental research on biomolecular mechanisms. The methodology can be used to bootstrap the construction of semantically rich biological models using the results of knowledge extraction processes. Models specific to particular experiments can be constructed that, in turn, link with other semantic models, creating a web of knowledge that spans experiments. Mapping mechanisms can link to other knowledge resources such as OBO ontologies or SKOS vocabularies. AIDA Web Services can be used to design personalized knowledge extraction procedures. In our example experiment, we found three proteins (NF-Kappa B, p21, and Bax) potentially playing a role in the interplay between nutrients and epigenetic gene regulation.

SUBMITTER: Roos M

PROVIDER: S-EPMC2755830 | biostudies-literature | 2009 Oct

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Structuring and extracting knowledge for the support of hypothesis generation in molecular biology.

Roos Marco M Marshall M Scott MS Gibson Andrew P AP Schuemie Martijn M Meij Edgar E Katrenko Sophia S van Hage Willem Robert WR Krommydas Konstantinos K Adriaans Pieter W PW

BMC bioinformatics 20091001

<h4>Background</h4>Hypothesis generation in molecular and cellular biology is an empirical process in which knowledge derived from prior experiments is distilled into a comprehensible model. The requirement of automated support is exemplified by the difficulty of considering all relevant facts that are contained in the millions of documents available from PubMed. Semantic Web provides tools for sharing prior knowledge, while information retrieval and information extraction techniques enable its ...[more]

PMID: 19796406

Similar Datasets

Project description:BACKGROUND:Videographic material of animals can contain inapparent signals, such as color changes or motion that hold information about physiological functions, such as heart and respiration rate, pulse wave velocity, and vocalization. Eulerian video magnification allows the enhancement of such signals to enable their detection. The purpose of this study is to demonstrate how signals relevant to experimental physiology can be extracted from non-contact videographic material of animals. RESULTS:We applied Eulerian video magnification to detect physiological signals in a range of experimental models and in captive and free ranging wildlife. Neotenic Mexican axolotls were studied to demonstrate the extraction of heart rate signal of non-embryonic animals from dedicated videographic material. Heart rate could be acquired both in single and multiple animal setups of leucistic and normally colored animals under different physiological conditions (resting, exercised, or anesthetized) using a wide range of video qualities. Pulse wave velocity could also be measured in the low blood pressure system of the axolotl as well as in the high-pressure system of the human being. Heart rate extraction was also possible from videos of conscious, unconstrained zebrafish and from non-dedicated videographic material of sand lizard and giraffe. This technique also allowed for heart rate detection in embryonic chickens in ovo through the eggshell and in embryonic mice in utero and could be used as a gating signal to acquire two-phase volumetric micro-CT data of the beating embryonic chicken heart. Additionally, Eulerian video magnification was used to demonstrate how vocalization-induced vibrations can be detected in infrasound-producing Asian elephants. CONCLUSIONS:Eulerian video magnification provides a technique to extract inapparent temporal signals from videographic material of animals. This can be applied in experimental and comparative physiology where contact-based recordings (e.g., heart rate) cannot be acquired.

Dataset Information

Structuring and extracting knowledge for the support of hypothesis generation in molecular biology.

Background

Results

Conclusion

Publications

Structuring and extracting knowledge for the support of hypothesis generation in molecular biology.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets