首页> 外文学位 >Characterizing and mining citation graph of computer science literature.
【24h】

Characterizing and mining citation graph of computer science literature.

机译:表征和挖掘计算机科学文献的引文图。

获取原文
获取原文并翻译 | 示例

摘要

Computer science literature, as many other natural systems behave, form a directed graph—we call it Citation Graph of Computer Science Literature, whose nodes are articles and edges are links to the articles cited in a paper. With hundreds and thousands of publications getting published each year in computer science, people are more interested in exploring the features hidden behind such huge directed graph by modern graph-theoretic techniques. In this study, we constructed a web robot querying the prominent computer science digital library ResearchIndex to build citation graphs. With the reasonable size citation graph in hand, we first verified that the in-degrees of nodes (i.e., the citations of articles) follow the Power law distribution. Next, we apply a series graph theoretic algorithms on it: Weakly Connected Component, Strongly Connected Component, Biconnected Component, Global Minimum Cut, Max-flow Min-cut and Dijkstra's Shortest Path algorithm and do numerical analyses of these results. Our study indicate that the citation graph formed by computer science literature are connected very well and its widespread connectivity doesn't depend on “hubs” and “authorities”. The experimental results also show that the macroscopic structure of the citation graph is different from the macroscopic structure of Web graph which is Bow Tie model. Also, based on the citation graph built by querying ResearchIndex which is a subset and snapshot of whole citation graph, we provide the diameter measurements.
机译:像许多其他自然系统一样,计算机科学文献形成有向图,我们称其为“计算机科学文献引文图”,其节点为文章,边为论文引用的文章的链接。每年都有成千上万的计算机科学出版物发表,人们对通过现代图论技术探索如此巨大的有向图背后隐藏的功能更加感兴趣。在这项研究中,我们构建了一个网络机器人,查询著名的计算机科学数字图书馆 ResearchIndex 来构建引文图。有了适当大小的引文图,我们首先验证了节点的入度(即,文章的引文)遵循幂定律分布。接下来,我们在其上应用一系列图论算法:弱连接组件,强连接组件,双向连接组件,全局最小割,最大流最小割 Dijkstra的最短路径算法,并对这些结果进行数值分析。我们的研究表明,由计算机科学文献形成的引文图连接非常紧密,其广泛的连通性并不依赖于“集线器”和“权威”。实验结果还表明,引文图的宏观结构与Web图的蝴蝶结模型的宏观结构不同。另外,基于通过查询 ResearchIndex 构建的引文图(它是整个引文图的子集和快照),我们提供了直径测量。

著录项

  • 作者

    An, Yuan.;

  • 作者单位

    Dalhousie University (Canada).;

  • 授予单位 Dalhousie University (Canada).;
  • 学科 Computer Science.; Information Science.
  • 学位 M.Comp.Sc.
  • 年度 2001
  • 页码 37 p.
  • 总页数 37
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;信息与知识传播;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号