首页> 外文会议>Pacific-Asia Conference on Knowledge Discovery and Data Mining >Connecting the Dots: Hypotheses Generation by Leveraging Semantic Shifts
【24h】

Connecting the Dots: Hypotheses Generation by Leveraging Semantic Shifts

机译:连接点:通过语义转移生成假设

获取原文

摘要

Literature-based Discovery (LBD) (a.k.a. Hypotheses Generation) is a systematic knowledge discovery process that elicit novel inferences about previously unknown scientific knowledge by rationally connecting complementary and non-interactive literature. Prompt identification of such novel knowledge is beneficial not only for researchers but also for various other stakeholders such as universities, funding bodies and academic publishers. Almost all the prior LBD research suffer from two major limitations. Firstly, the over-reliance of domain-dependent resources which restrict the models' applicability to certain domains/problems. In this regard, we propose a generalisable LBD model that supports both cross-domain and cross-lingual knowledge discovery. The second persistent research deficiency is the mere focus of static snapshot of the corpus (i.e. ignoring the temporal evolution of topics) to detect the new knowledge. However, the knowledge in scientific literature changes dynamically and thus relying merely on static snapshot limits the model's ability in capturing semantically meaningful connections. As a result, we propose a novel temporal model that captures semantic change of topics using diachronic word embeddings to unravel more accurate connections. The model was evaluated using the largest available literature repository to demonstrate the efficiency of the proposed cues towards recommending novel knowledge.
机译:基于文献的发现(LBD)(又称为假设生成)是一种系统的知识发现过程,它通过合理地连接互补性和非交互性文献来引发对先前未知的科学知识的新颖推断。及时识别此类新颖知识不仅对研究人员有益,而且对其他利益相关者(如大学,资助机构和学术出版商)也有利。几乎所有先前的LBD研究都受到两个主要限制。首先,过度依赖领域相关资源限制了模型对某些领域/问题的适用性。在这方面,我们提出了一个可概括的LBD模型,该模型支持跨域和跨语言的知识发现。第二个持续的研究缺陷是仅靠语料库的静态快照(即忽略主题的时间演变)来检测新知识。但是,科学文献中的知识是动态变化的,因此仅依靠静态快照会限制模型捕获语义上有意义的连接的能力。结果,我们提出了一种新颖的时间模型,该模型使用历时性词嵌入捕获主题的语义变化,以阐明更准确的联系。使用最大的可用文献资料库对模型进行了评估,以证明所建议线索对推荐新知识的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号