Important citation identification by exploiting content and section-wise in-text citation count

Shahzad Nazir; Muhammad Asif; Shahbaz Ahmad; Faisal Bukhari; Muhammad Tanvir Afzal; Hanan Aljuaid

首页> 外文期刊>PLoS One >Important citation identification by exploiting content and section-wise in-text citation count

【24h】

Important citation identification by exploiting content and section-wise in-text citation count

机译：利用内容和文本文本引文计数的重要引文识别

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A citation is deemed as a potential parameter to determine linkage between research articles. The parameter has extensively been employed to form multifarious academic aspects like calculating the impact factor of journals, h-Index of researchers, allocate different research grants, find the latest research trends, etc. The current state-of-the-art contends that all citations are not of equal importance. Based on this argument, the current trend in citation classification community categorizes citations into important and non-important reasons. The community has proposed different approaches to extract important citations such as citation count, context-based, metadata, and textual based approaches. The contemporary state-of-the-art in citation classification community ignores significantly potential features that can play a vital role in citation classification. This research presents a novel approach for binary citation classification by exploiting section-wise in-text citation frequencies, similarity score, and overall citation count-based features. The study also introduces machine learning algorithms based novel approach for assigning appropriate weights to the logical sections of research papers. The weights are allocated to the citations with respect to their sections. To perform the classification, we used three classification techniques, Support Vector Machine, Kernel Linear Regression, and Random Forest. The experiment was performed on two annotated benchmark datasets that contain 465 and 311 citation pairs of research articles respectively. The results revealed that the proposed approach attained an improved value of precision (i.e., 0.84 vs 0.72) from contemporary state-of-the-art approach.

机译：引用被认为是确定研究文章之间联动的潜在参数。该参数广泛用于形成多种学业方面，如计算期刊的影响因素，研究人员的H-Indep，分配不同的研究拨款，找到最新的研究趋势等。目前的最先进引文并不同等重要。基于此论点，引文分类界的当前趋势将引用分类为重要且非重要原因。社区提出了不同的方法来提取重要的引文，如引文计数，基于上下文，元数据和基于文本的方法。当代的引文群落中当代最先进的界面忽略了显着的潜在功能，可以在引文分类中发挥重要作用。本研究通过剥削文本文本引文，相似度得分和基于整体引用计数的特征来提出二进制引文分类的新方法。该研究还介绍了基于机器学习算法的新方法，用于将适当的重量分配给研究论文的逻辑部分。重量与他们的部分分配给引文。要执行分类，我们使用了三种分类技术，支持向量机，内核线性回归和随机林。在两个注释的基准数据集上进行实验，分别包含465和311引用研究文章。结果表明，拟议的方法从当代最先进的方法中获得了改善的精确度（即0.84 vs 0.72）。

著录项

来源
《PLoS One》 |2020年第3期|共19页
作者
Shahzad Nazir; Muhammad Asif; Shahbaz Ahmad; Faisal Bukhari; Muhammad Tanvir Afzal; Hanan Aljuaid;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类医药、卫生;
关键词

相似文献

外文文献
中文文献
专利

1. Dimensions and Uncertainties of Author Citation Rankings: Lessons Learned From Frequency-Weighted In-Text Citation Counting [J] . Dangzhi Zhao, Andreas Strotmann Journal of the American Society for Information Science and Technology . 2016,第3期

机译：作者引文排名的维度和不确定性：频率加权文字引文计数的经验教训
2. Important citation identification using sentiment analysis of in-text citations [J] . Aljuaid Hanan, Iftikhar Rimsha, Ahmad Shahbaz, Telematics and Informatics . 2021,第Jana期

机译：重要的引文识别使用文中文本的情感分析
3. An analysis of in-text citations based on fractional counting [J] . Pak Chol Myong, Wang Weibin, Yu Guang Journal of informetrics . 2020,第4期

机译：基于分数计数的文本文本引文分析
4. Important Citation Identification by Exploiting the Optimal In-text Citation Frequency [C] . Shahzad Nazir, Muhammad Asif, Shahbaz Ahmad International Conference on Engineering and Emerging Technologies . 2020

机译：通过利用最佳的文本引用频率来识别重要的引用
5. A quantitative content analysis of in-text citations in choral pedagogy books published between 1989–2009 [D] . Jones, Sarah K. 2010

机译：1989 - 2009年间公布的合唱教学图书中文本文本的定量内容分析
6. Important citation identification by exploiting content and section-wise in-text citation count [O] . Shahzad Nazir, Muhammad Asif, Shahbaz Ahmad, 2020

机译：通过利用内容和文本文本引文计数的重要引文识别
7. Important citation identification by exploiting content and section-wise in-text citation count [O] . Shahzad Nazir, Muhammad Asif, Shahbaz Ahmad, 2020

机译：通过利用内容和文本文本引文计数的重要引文识别
8. Clicks versus Citations: Click Count as a Metric in High Energy Physics Publishing. [R] . Bitton, A. 2011

机译：点击与引文：点击计数作为高能物理出版的指标。

Important citation identification by exploiting content and section-wise in-text citation count

摘要

著录项

相似文献

相关主题

期刊订阅