Unknown

Dataset Information

0

Learning to rank-based gene summary extraction.


ABSTRACT: BACKGROUND: In recent years, the biomedical literature has been growing rapidly. These articles provide a large amount of information about proteins, genes and their interactions. Reading such a huge amount of literature is a tedious task for researchers to gain knowledge about a gene. As a result, it is significant for biomedical researchers to have a quick understanding of the query concept by integrating its relevant resources. METHODS: In the task of gene summary generation, we regard automatic summary as a ranking problem and apply the method of learning to rank to automatically solve this problem. This paper uses three features as a basis for sentence selection: gene ontology relevance, topic relevance and TextRank. From there, we obtain the feature weight vector using the learning to rank algorithm and predict the scores of candidate summary sentences and obtain top sentences to generate the summary. RESULTS: ROUGE (a toolkit for summarization of automatic evaluation) was used to evaluate the summarization result and the experimental results showed that our method outperforms the baseline techniques. CONCLUSIONS: According to the experimental result, the combination of three features can improve the performance of summary. The application of learning to rank can facilitate the further expansion of features for measuring the significance of sentences.

SUBMITTER: Shang Y 

PROVIDER: S-EPMC4243090 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Learning to rank-based gene summary extraction.

Shang Yue Y   Hao Huihui H   Wu Jiajin J   Lin Hongfei H  

BMC bioinformatics 20141106


<h4>Background</h4>In recent years, the biomedical literature has been growing rapidly. These articles provide a large amount of information about proteins, genes and their interactions. Reading such a huge amount of literature is a tedious task for researchers to gain knowledge about a gene. As a result, it is significant for biomedical researchers to have a quick understanding of the query concept by integrating its relevant resources.<h4>Methods</h4>In the task of gene summary generation, we  ...[more]

Similar Datasets

| S-EPMC3866120 | biostudies-literature
| S-EPMC10724228 | biostudies-literature
| S-EPMC6504107 | biostudies-literature
| S-EPMC11340143 | biostudies-literature
| S-EPMC4972358 | biostudies-literature
| S-EPMC6084606 | biostudies-literature
| S-EPMC8489107 | biostudies-literature
| S-EPMC3212578 | biostudies-literature
| S-EPMC5154690 | biostudies-literature
| S-EPMC4987638 | biostudies-literature