首页> 外文会议>International conference on Asian-Pacific digital libraries >Measuring the Semantic World - How to Map Meaning to High-Dimensional Entity Clusters in PubMed?
【24h】

Measuring the Semantic World - How to Map Meaning to High-Dimensional Entity Clusters in PubMed?

机译:测量语义世界-如何在PubMed中将意义映射到高维实体簇?

获取原文

摘要

The exponential increase of scientific publications in the medical field urgently calls for innovative access paths beyond the limits of a term-based search. As an example, the search term "diabetes" leads to a result of over 600,000 publications in the medical digital library PubMed. In such cases, the automatic extraction of semantic relations between important entities like active substances, diseases, and genes can help to reveal entity-relationships and thus allow simplified access to the knowledge embedded in digital libraries. On the other hand, for semantic-relation tasks distributional embedding models based on neural networks promise considerable progress in terms of accuracy, performance and scalability. Yet, despite the recent successes of neural networks in this field, questions arise related to their non-deterministic nature: Are the semantic relations meaningful, and perhaps even new and unknown entity-relationships? In this paper, we address this question by measuring the associations between important pharmaceutical entities such as active substances (drugs) and diseases in high-dimensional embedded space. In our investigation, we show that while on one hand only few of the contextualized associations directly correlate with spatial distance, on the other hand we have discovered their potential for predicting new associations, which makes the method suitable as a new, literature-based technique for important practical tasks like e.g., drug repurposing.
机译:在医学领域,科学出版物的呈指数增长迫切要求超越基于术语的搜索范围的创新访问途径。例如,搜索词“糖尿病”导致医学数字图书馆PubMed中超过600,000种出版物的结果。在这种情况下,自动提取重要实体(例如活性物质,疾病和基因)之间的语义关系可以帮助揭示实体关系,从而简化对数字图书馆中嵌入的知识的访问。另一方面,对于语义关系任务,基于神经网络的分布嵌入模型有望在准确性,性能和可伸缩性方面取得长足进步。然而,尽管神经网络在该领域最近取得了成功,但仍存在与它们的不确定性有关的问题:语义关系是否有意义,甚至可能是新的和未知的实体关系?在本文中,我们通过测量重要药物实体(例如活性物质(药物))与高维嵌入空间中的疾病之间的关联来解决此问题。在我们的调查中,我们发现,一方面,只有少数上下文关联与空间距离​​直接相关,另一方面,我们发现了它们预测新关联的潜力,这使该方法适合作为一种基于文献的新技术用于重要的实际任务,例如重新利用毒品。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号