首页> 外文会议>International Conference on Engineering and Emerging Technologies >Important Citation Identification by Exploiting the Optimal In-text Citation Frequency
【24h】

Important Citation Identification by Exploiting the Optimal In-text Citation Frequency

机译:利用最佳文本引文引用频率的重要引文识别

获取原文

摘要

Research is always based on previously done work. To acknowledge the worthy work of the predecessors of the field, researchers do citations. Citations are factors that are used for measuring the impact factor of journals, to rank the researchers, to find out latest research topics, for allocating research grants etc. In current epoch the research community has turned their focus towards citations and is of the view that all citations are not equally important. To find out important citations, researchers used different approaches such as context based, cue word based, metadata based, frequency based, textual based etc. Among proposed methodologies, frequency based approach was extensively used. The citation with high frequency was considered as important, but it is yet unclear that what should be the frequency cut off value of citation for considering it important. This research explored the significance of applying Threshold value over Frequency count for binary classification. We identified optimal threshold value of frequency count and further applied this to classify the citations into important and non-important ones. To evaluate the proposed approach a benchmark data set annotated by two domain experts was used that consisted of 465 citation pairs. The results were compared with state of the art precision value of 0.72. While the experiment showed increased value of 0.75 in terms of precision.
机译:研究始终基于以前完成的工作。为了承认该领域前任的有价值的工作,研究人员做了引文。引用是用于测量期刊影响因子的因素,对研究人员进行排名,找到最新的研究主题,用于分配研究拨款等。在当前时代,研究界已经将他们的重点转向引用,并认为所有引文并不同样重要。为了了解重要的引用,研究人员使用了不同的方法,例如基于的上下文,基于词的词,基于元数据,基于频率的,文本的等。在提出的方法中,基于频率的方法是广泛的使用。高频的引文被认为是重要的,但目前目前尚不清楚引文的频率切断值,以考虑重要。本研究探讨了在二进制分类中施加阈值的重要性。我们确定了频率计数的最佳阈值,并进一步应用于将引用分类为重要而非重要性。为了评估所提出的方法,使用由两个域专家注释的基准数据集,由465个引文对组成。将结果与最新的技术精度值为0.72进行比较。虽然实验在精度方面表现出0.75的增加。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号