Unknown

Dataset Information

0

Preserving sequence annotations across reference sequences.


ABSTRACT: BACKGROUND:Matching and comparing sequence annotations of different reference sequences is vital to genomics research, yet many annotation formats do not specify the reference sequence types or versions used. This makes the integration of annotations from different sources difficult and error prone. RESULTS:As part of our effort to create linked data for interoperable sequence annotations, we present an RDF data model for sequence annotation using the ontological framework established by the OBO Foundry ontologies and the Basic Formal Ontology (BFO). We defined reference sequences as the common domain of integration for sequence annotations, and identified three semantic relationships between sequence annotations. In doing so, we created the Reference Sequence Annotation to compensate for gaps in the SO and in its mapping to BFO, particularly for annotations that refer to versions of consensus reference sequences. Moreover, we present three integration models for sequence annotations using different reference assemblies. CONCLUSIONS:We demonstrated a working example of a sequence annotation instance, and how this instance can be linked to other annotations on different reference sequences. Sequence annotations in this format are semantically rich and can be integrated easily with different assemblies. We also identify other challenges of modeling reference sequences with the BFO.

SUBMITTER: Tatum Z 

PROVIDER: S-EPMC4108922 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Preserving sequence annotations across reference sequences.

Tatum Zuotian Z   Roos Marco M   Gibson Andrew P AP   Taschner Peter Em PE   Thompson Mark M   Schultes Erik A EA   Laros Jeroen Fj JF  

Journal of biomedical semantics 20140603 Suppl 1 Proceedings of the Bio-Ontologies Spec Interest


<h4>Background</h4>Matching and comparing sequence annotations of different reference sequences is vital to genomics research, yet many annotation formats do not specify the reference sequence types or versions used. This makes the integration of annotations from different sources difficult and error prone.<h4>Results</h4>As part of our effort to create linked data for interoperable sequence annotations, we present an RDF data model for sequence annotation using the ontological framework establi  ...[more]

Similar Datasets

| S-EPMC7316827 | biostudies-literature
| S-EPMC5963392 | biostudies-literature
| S-EPMC7296393 | biostudies-literature
| S-EPMC4384018 | biostudies-literature
| S-EPMC5662012 | biostudies-literature
| S-EPMC7336184 | biostudies-literature
| S-EPMC430176 | biostudies-literature
| S-EPMC5663517 | biostudies-literature
| S-EPMC3110597 | biostudies-literature
| S-EPMC8888255 | biostudies-literature