Dataset Information

Important citation identification by exploiting content and section-wise in-text citation count.

ABSTRACT: A citation is deemed as a potential parameter to determine linkage between research articles. The parameter has extensively been employed to form multifarious academic aspects like calculating the impact factor of journals, h-Index of researchers, allocate different research grants, find the latest research trends, etc. The current state-of-the-art contends that all citations are not of equal importance. Based on this argument, the current trend in citation classification community categorizes citations into important and non-important reasons. The community has proposed different approaches to extract important citations such as citation count, context-based, metadata, and textual based approaches. The contemporary state-of-the-art in citation classification community ignores significantly potential features that can play a vital role in citation classification. This research presents a novel approach for binary citation classification by exploiting section-wise in-text citation frequencies, similarity score, and overall citation count-based features. The study also introduces machine learning algorithms based novel approach for assigning appropriate weights to the logical sections of research papers. The weights are allocated to the citations with respect to their sections. To perform the classification, we used three classification techniques, Support Vector Machine, Kernel Linear Regression, and Random Forest. The experiment was performed on two annotated benchmark datasets that contain 465 and 311 citation pairs of research articles respectively. The results revealed that the proposed approach attained an improved value of precision (i.e., 0.84 vs 0.72) from contemporary state-of-the-art approach.

SUBMITTER: Nazir S

PROVIDER: S-EPMC7058319 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Important citation identification by exploiting content and section-wise in-text citation count.

Nazir Shahzad S Asif Muhammad M Ahmad Shahbaz S Bukhari Faisal F Afzal Muhammad Tanvir MT Aljuaid Hanan H

PloS one 20200305 3

A citation is deemed as a potential parameter to determine linkage between research articles. The parameter has extensively been employed to form multifarious academic aspects like calculating the impact factor of journals, h-Index of researchers, allocate different research grants, find the latest research trends, etc. The current state-of-the-art contends that all citations are not of equal importance. Based on this argument, the current trend in citation classification community categorizes c ...[more]

PMID: 32134940

Dataset Information

Important citation identification by exploiting content and section-wise in-text citation count.

Publications

Important citation identification by exploiting content and section-wise in-text citation count.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Database citation in full text biomedical articles.
| S-EPMC3667078 | biostudies-literature

Using citation networks to evaluate the impact of text length on keyword extraction.
| S-EPMC10681196 | biostudies-literature

Database citation in supplementary data linked to Europe PubMed Central full text biomedical articles.
| S-EPMC4363206 | biostudies-literature

Position-wise binding preference is important for miRNA target site prediction.
| S-EPMC8453239 | biostudies-literature

Computational Modeling of Stereotype Content in Text.
| S-EPMC9063736 | biostudies-literature

Table to text generation with accurate content copying.
| S-EPMC8611016 | biostudies-literature

Librarian involvement on knowledge synthesis articles and its relationship to article citation count and Journal Impact Factor.
| S-EPMC11881647 | biostudies-literature

Minimal clinically important difference for daily pedometer step count in COPD.
| S-EPMC7983253 | biostudies-literature

Exploiting Rye in Wheat Quality Breeding: The Case of Arabinoxylan Content.
| S-EPMC9965444 | biostudies-literature