首页> 外文会议>International Conference on Information Visualisation >Literature Visualization and Similarity Measurement Based on Citation Relations
【24h】

Literature Visualization and Similarity Measurement Based on Citation Relations

机译:基于引用关系的文献可视化与相似度度量

获取原文

摘要

While similar documents are, traditionally, found using Natural Language Processing, we observe reference/citation information by authors indicates better insight of similarity. Our system is to retrieve publications from Google Scholar and visualize them as a 2D graph using the citation relation, where the nodes represent the documents while the links represent the citation/reference relation between them. We measure the similarity score between each pair of papers based on both the number of paths and the length of each path. More paths and shorter the lengths higher the similarity score. We compared them with another similarity scores from Scurtu's Document Similarity API [1] that uses Natural Language Processing. We use the average of the similarity scores collected from 15 users as a ground truth to determine how good the scores from two methods are. The result shows that our citation network approach gives better results than the ones by Scurtu's.
机译:传统上,使用自然语言处理可以找到相似的文档,但是我们观察到作者的参考文献/引用信息表明对相似性有更好的了解。我们的系统是从Google学术搜索中检索出版物,并使用引用关系将它们可视化为2D图形,其中节点代表文档,而链接则代表它们之间的引用/引用关系。我们基于路径数和每个路径的长度来衡量每对论文之间的相似性得分。路径越多,长度越短,相似度得分越高。我们将它们与使用自然语言处理的Scurtu文档相似度API [1]中的另一个相似度分数进行了比较。我们使用从15个用户那里收集的相似性分数的平均值作为基本事实,来确定两种方法的分数有多好。结果表明,与Scurtu的方法相比,我们的引文网络方法具有更好的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号