首页> 外文会议>Pacific-Asia conference on knowledge discovery and data mining >Mining Text Enriched Heterogeneous Citation Networks
【24h】

Mining Text Enriched Heterogeneous Citation Networks

机译:挖掘文本丰富的异构引用网络

获取原文

摘要

The paper presents an approach to mining text enriched heterogeneous information networks, applied to a task of categorizing papers from a large citation network of scientific publications in the field of psychology. The methodology performs network propositionalization by calculating structural context vectors from homogeneous networks, extracted from the original network. The classifier is constructed from a table of structural context vectors, enriched with the bag-of-words vectors calculated from individual paper abstracts. A series of experiments was performed to examine the impact of increasing the number of publications in the network, and adding different types of structural context vectors. The results indicate that increasing the network size and combining both types of information is beneficial for improving the accuracy of paper categorization.
机译:本文提出了一种挖掘文本丰富的异构信息网络的方法,该方法适用于对来自心理学领域大型科学出版物引文网络中的论文进行分类的任务。该方法通过计算从原始网络提取的同构网络的结构上下文向量来执行网络命题化。分类器是根据结构上下文向量表构建的,该表丰富了根据各个论文摘要计算出的词袋向量。进行了一系列实验,以检查增加网络中的出版物数量以及添加不同类型的结构上下文向量的影响。结果表明,增加网络规模并结合两种类型的信息有利于提高纸张分类的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号