Identifying Important Citations Using Contextual Information from Full Text

机译：使用全文中的上下文信息识别重要的引文

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we address the problem of classifying cited work into important and non-important to the developments presented in a research publication. This task is vital for the algorithmic techniques that detect and follow emerging research topics and to qualitatively measure the impact of publications in increasingly growing scholarly big data. We consider cited work as important to a publication if that work is used or extended in some way. If a reference is cited as background work or for the purpose of comparing results, the cited work is considered to be non-important. By employing five classification techniques (Support Vector Machine, Naïve Bayes, Decision Tree, K-Nearest Neighbors and Random Forest) on an annotated dataset of 465 citations, we explore the effectiveness of eight previously published features and six novel features (including context based, cue words based and textual based). Within this set, our new features are among the best performing. Using the Random Forest classifier we achieve an overall classification accuracy of 0.91 AUC.

机译：在本文中，我们解决了将引用的工作分类为对研究出版物中提出的发展重要且不重要的问题。这项任务对于检测和跟踪新兴研究主题并定性评估出版物在日益增长的学术大数据中的影响的算法技术至关重要。如果以某种方式使用或扩展引用的作品，我们认为该作品对出版物很重要。如果引用参考文献作为背景工作或出于比较结果的目的，则认为所引用的工作不重要。通过在465条引用的带注释的数据集上采用五种分类技术（支持向量机，朴素贝叶斯，决策树，K最近邻和随机森林），我们探索了八种先前发布的功能和六种新颖功能（包括基于上下文，基于提示词和基于文本）。在这个组合中，我们的新功能是性能最好的。使用随机森林分类器，我们可以实现0.91 AUC的总体分类精度。

著录项

来源
《2017 ACM/IEEE Joint Conference on Digital Libraries》|2017年|1-8|共8页
会议地点 Toronto(CA)
作者
Saeed-Ul Hassan; Anam Akram; Peter Haddawy;
展开▼
作者单位

Dept. of Comput. Sci., Inf. Technol. Univ., Lahore, Pakistan;

Dept. of Comput. Sci., Inf. Technol. Univ., Lahore, Pakistan;

Fac. of ICT, Mahidol Univ. Salaya, Nakhonpathom, Thailand;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Training; Context modeling; Feature extraction; Classification algorithms; Support vector machines; Computer science; Information technology;

机译：培训;上下文建模;特征提取;分类算法;支持向量机;计算机科学;信息技术;

相似文献

外文文献
中文文献
专利

1. Original Research - Special Collection: Qumran Texts Habakkuk 2:5a: Denouncing ?￠????wine?￠???? or ?￠????wealth?￠????? Contextual readings of the Masoretic text and 1QpHab Crossref Citations [J] . Gert Prinsloo HTS Teologiese Studies/Theological Studies . 2016,第4期

机译：原始研究-特殊收藏：Qumran Texts哈巴谷书2：5a：放弃酒吗？还是？￠ ???? wealth？￠ ?????? Masoretic文本和1QpHab Crossref引文的上下文阅读
2. Identifying Scientific Project-generated Data Citation from Full-text Articles: An Investigation of TCGA Data Citation [J] . Jiao Li, Si Zheng, Hongyu Kang, Journal of Data and Information Science . 2017,第2期

机译：从全文文章中识别科学项目生成的数据引用：TCGA数据引用的调查
3. Identifying Scientific Project-generated Data Citation from Full-text Articles: An Investigation of TCGA Data Citation [J] . Jiao Li, Si Zheng, Hongyu Kang, Journal of Data and Information Science . 2016,第2期

机译：从全文文章中识别科学项目生成的数据引用：TCGA数据引用的调查
4. Identifying Important Citations Using Contextual Information from Full Text [C] . Saeed-Ul Hassan, Anam Akram, Peter Haddawy ACM/IEEE Joint Conference on Digital Libraries . 2017

机译：使用来自全文中的上下文信息识别重要引用
5. Citation handling: Processing citation texts in scientific documents. [D] . Whidby, Michael Alan. 2012

机译：引文处理：处理科学文献中的引文。
6. Important citation identification by exploiting content and section-wise in-text citation count [O] . Shahzad Nazir, Muhammad Asif, Shahbaz Ahmad, 2020

机译：通过利用内容和文本文本引文计数的重要引文识别
7. PDF (40 K) View thumbnail images View full size images Add to my quick links Cited by E-mail article Save as citation alert Export citation + link Set up a citation RSS feed (Opens new window) Related Articles in ScienceDirect Contents of volume 154 Physics of The Earth and Planetary Interiors Close You are entitled to access the full text of this document Contents of volume 154 Physics of The Earth and Planetary Interiors, Volume 154, Issues 3-4, 16 March 2006, Pages 350-351 PDF (25 K) Special issue contents page Physics of The Earth and Planetary Interiors Close You are entitled to access the full text of this document Special issue contents page Physics of The Earth and Planetary Interiors, Volume 154, Issues 3-4, 16 March 2006, Page iv PDF (22 K) View More Related Articles Bookmark and share in 2collab (opens in new window) Request permission to reuse this article View Record in Scopus Cited By in Scopus (0) doi:10.1016/j.pepi.2005.12.002 How to Cite or Link Using DOI (Opens New Window) Copyright © 2006 Elsevier B.V. All rights reserved. Preface [O] . Lagroix France, Muxworthy Adrian, Hoffmann Viktor 2006

机译：PDF（40 K）查看缩略图查看全尺寸图像添加到我的快速链接被电子邮件引用引用另存为引用警报导出引用+链接设置引用RSS提要（打开新窗口）ScienceDirect中的相关文章第154卷的内容地球和行星内部物理学您有权访问本文档的全文。第154卷的内容2006年3月16日，第154卷，第3-4期，第154卷，第350-351页PDF（25 K）特刊内容页地球和行星内饰关闭您有权访问本文档的全文特别发行内容页面地球与行星内饰物理，第154卷，第3-4期，2006年3月16日，第iv PDF（22 K）查看更多相关文章在2collab中添加书签并共享（在新窗口中打开）请求重新使用本文的权限在Scopus中查看记录在Scopus中被引用（0）doi：10.1016 / j.pepi.2005.12.002如何使用DOI进行引用或链接（打开新窗口）版权所有©2006 Elsevier B .V。保留所有权利。前言
8. Corpus and Method for Identifying Citations in Non-Academic Text (Open Access, Publisher's Version). [R] . He, Y., Meyers, A. 2014

机译：在非学术文本中识别引文的语料库和方法（开放存取，出版商版）。

Identifying Important Citations Using Contextual Information from Full Text

摘要

著录项

相似文献

相关主题

期刊订阅