Unknown

Dataset Information

0

Improved data retrieval from TreeBASE via taxonomic and linguistic data enrichment.


ABSTRACT: TreeBASE, the only data repository for phylogenetic studies, is not being used effectively since it does not meet the taxonomic data retrieval requirements of the systematics community. We show, through an examination of the queries performed on TreeBASE, that data retrieval using taxon names is unsatisfactory.We report on a new wrapper supporting taxon queries on TreeBASE by utilising a Taxonomy and Classification Database (TCl-Db) we created. TCl-Db holds merged and consolidated taxonomic names from multiple data sources and can be used to translate hierarchical, vernacular and synonym queries into specific query terms in TreeBASE. The query expansion supported by TCl-Db shows very significant information retrieval quality improvement. The wrapper can be accessed at the URL http://spira.zoology.gla.ac.uk/app/tbasewrapper.phpThe methodology we developed is scalable and can be applied to new data, as those become available in the future.Significantly improved data retrieval quality is shown for all queries, and additional flexibility is achieved via user-driven taxonomy selection.

SUBMITTER: Anwar N 

PROVIDER: S-EPMC2685121 | biostudies-literature | 2009 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improved data retrieval from TreeBASE via taxonomic and linguistic data enrichment.

Anwar Nadia N   Hunt Ela E  

BMC evolutionary biology 20090508


<h4>Background</h4>TreeBASE, the only data repository for phylogenetic studies, is not being used effectively since it does not meet the taxonomic data retrieval requirements of the systematics community. We show, through an examination of the queries performed on TreeBASE, that data retrieval using taxon names is unsatisfactory.<h4>Results</h4>We report on a new wrapper supporting taxon queries on TreeBASE by utilising a Taxonomy and Classification Database (TCl-Db) we created. TCl-Db holds mer  ...[more]

Similar Datasets

| S-EPMC3170168 | biostudies-literature
| S-EPMC3901538 | biostudies-literature
| S-EPMC4860591 | biostudies-literature
| S-EPMC8110821 | biostudies-literature
| S-EPMC8648753 | biostudies-literature
| S-EPMC5343174 | biostudies-literature
| S-EPMC6461645 | biostudies-literature
| S-EPMC5666573 | biostudies-literature
| S-EPMC7255349 | biostudies-literature
| S-EPMC3929260 | biostudies-literature