Unknown

Dataset Information

0

Opportunities for text mining in the FlyBase genetic literature curation workflow.


ABSTRACT: FlyBase is the model organism database for Drosophila genetic and genomic information. Over the last 20 years, FlyBase has had to adapt and change to keep abreast of advances in biology and database design. We are continually looking for ways to improve curation efficiency and efficacy. Genetic literature curation focuses on the extraction of genetic entities (e.g. genes, mutant alleles, transgenic constructs) and their associated phenotypes and Gene Ontology terms from the published literature. Over 2000 Drosophila research articles are now published every year. These articles are becoming ever more data-rich and there is a growing need for text mining to shoulder some of the burden of paper triage and data extraction. In this article, we describe our curation workflow, along with some of the problems and bottlenecks therein, and highlight the opportunities for text mining. We do so in the hope of encouraging the BioCreative community to help us to develop effective methods to mine this torrent of information. DATABASE URL: http://flybase.org

SUBMITTER: McQuilton P 

PROVIDER: S-EPMC3500518 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

Opportunities for text mining in the FlyBase genetic literature curation workflow.

McQuilton Peter P  

Database : the journal of biological databases and curation 20121117


FlyBase is the model organism database for Drosophila genetic and genomic information. Over the last 20 years, FlyBase has had to adapt and change to keep abreast of advances in biology and database design. We are continually looking for ways to improve curation efficiency and efficacy. Genetic literature curation focuses on the extraction of genetic entities (e.g. genes, mutant alleles, transgenic constructs) and their associated phenotypes and Gene Ontology terms from the published literature.  ...[more]

Similar Datasets

| S-EPMC5130168 | biostudies-literature
| S-EPMC4457984 | biostudies-literature
| S-EPMC6917032 | biostudies-literature
| S-EPMC3629079 | biostudies-literature
| S-EPMC3515862 | biostudies-literature
| S-EPMC1539029 | biostudies-literature
| S-EPMC6191643 | biostudies-literature
| S-EPMC6891984 | biostudies-literature
| S-EPMC7078066 | biostudies-literature
| S-EPMC2719631 | biostudies-literature