首页> 外文期刊>Journal of informetrics >Mining Temporal Evolution of Knowledge Graphs and Genealogical Features for Literature-based Discovery Prediction
【24h】

Mining Temporal Evolution of Knowledge Graphs and Genealogical Features for Literature-based Discovery Prediction

机译:基于文学的发现预测的知识图和谱系特征的挖掘时间演变

获取原文
获取原文并翻译 | 示例
       

摘要

Literature-based discovery process identifies the important but implicit relations among information embedded in published literature. Existing techniques from Information Retrieval (IR) and Natural Language Processing (NLP) attempt to identify the hidden or unpublished connections between information concepts within published literature, however, these techniques overlooked the concept of predicting the future and emerging relations among scientific knowledge components such as author selected keywords encapsulated within the literature. Keyword Co-occurrence Network (KCN), built upon author selected keywords, is considered as a knowledge graph that focuses both on these knowledge components and knowledge structure of a scientific domain by examining the relationships between knowledge entities. Using data from two multidisciplinary research domains other than the bio-medical domain, and capitalizing on bibliometrics, the dynamicity of temporal KCNs, and a recurrent neural network, this study develops some novel features supportive for the prediction of the future literature-based discoveries - the emerging connections (co-appearances in the same article) among keywords. Temporal importance extracted from both bipartite and unipartite networks, communities defined by genealogical relations, and the relative importance of temporal citation counts were used in the feature construction process. Both node and edge-level features were input into a recurrent neural network to forecast the feature values and predict the future relations between different scientific concepts/topics represented by the author selected keywords. High performance rates, compared both against contemporary heterogeneous network-based method and preferential attachment process, suggest that these features complement both the prediction of future literature-based discoveries and emerging trend analysis. (C) 2020 Elsevier Ltd. All rights reserved.
机译:基于文学的发现过程识别出版文献中嵌入的信息之间的重要而隐含的关系。来自信息检索(IR)和自然语言处理(NLP)的现有技术试图识别出版文献中的信息概念之间的隐藏或未发布的连接,然而,这些技术忽略了预测科学知识组件之间未来和新兴关系的概念,如作者所选关键字封装在文献中。在作者所选关键字时构建的关键字共同发生网络(KCN)被视为通过检查知识实体之间的关系来专注于科学域的这些知识组件和知识结构。使用来自生物医学领域以外的两个多学科研究领域的数据,以及资本化的书法测量学,颞kCNs的动力学以及经常性的神经网络,这项研究开发了一些新颖的特征,支持预测未来的基于文学的发现 - 关键词中的新兴连接(同一条中的共同出场)。从二分和单一网络中提取的时间重要性,由谱系关系定义的社区,以及时间引用计数的相对重要性在特征施工过程中使用。将节点和边缘级别功能均输入到经常性神经网络中,以预测特征值并预测作者所选关键字所代表的不同科学概念/主题之间的未来关系。对当代异构网络的方法和优先附着过程相比,高性能率涉及这些功能,这些功能补充了未来文学的发现和新兴趋势分析的预测。 (c)2020 elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号