首页> 外文会议>International Conference on Discovery Science >A Bare Bones Approach to Literature-Based Discovery: An Analysis of the Raynaud’s/Fish-Oil and Migraine-Magnesium Discoveries in Semantic Space
【24h】

A Bare Bones Approach to Literature-Based Discovery: An Analysis of the Raynaud’s/Fish-Oil and Migraine-Magnesium Discoveries in Semantic Space

机译:基于文学的发现的光秃秃的骨骼方法:对语义空间中雷诺/鱼油和偏头痛的分析

获取原文

摘要

Literature discovery can be characterized as a goal directed search for previously unknown implicit knowledge captured within a collection of scientific articles. Swanson’s serendipitous discovery of a treatment for Raynaud’s disease by dietary fish-oil while browsing Medline, an online collection of biomedical literature, exemplifies such a discovery. By means of a series of experiments, the impact of stop words, various weighting schemes, discovery mechanisms, and contextual reduction are studied in relation to replicating the Raynaud/fish-oil and migraine-magnesium discoveries by operational means. Two aspects of discovery were brought under focus: (i) the discovery of intermediate, or B –terms, and (ii) the discovery of indirect A – C connections via the B–terms. A semantic space representation of the underlying corpus is computed and discoveries automated by computing associations between words in both higher and contextually reduced spaces. It was found that the discovery of B–terms and A – C connections can be achieved to an encouraging degree with a standard stop word list. In addition, no single weighting scheme seems to suffice. Log-likelihood appears to be potentially effective for leading to the discovery of B–terms, whereas both odds ratio and simple co-occurrence frequencies both facilitate the discovery of A – C connections. With regard to discovery mechanism, both semantic similarity (via cosine) and information flow computation seem promising for computing A – C connections, but more research is needed to understand their relative strengths and weaknesses. Discovery in a contextually reduced semantic space revealed mixed results.
机译:文献的发现可以看作是对科学文章的集合内捕获以前未知的隐性知识目标定向搜索。在浏览医学,生物医学文献的网上收集,例证了这样的发现,斯旺森对雷诺氏病日粮鱼油治疗的偶然发现。通过一系列的实验手段,停用词,各个加权方案,发现机制,和上下文减少的影响进行了研究,相对于由操作手段复制雷诺/鱼油和偏头痛镁发现。发现的两个方面进行了下焦点带来:(i)所述发现中间,或B组术语;以及(ii)间接A的发现 - 通过B-术语C有连接。底层语料库的语义空间表示被计算并发现通过计算单词之间的关联在更高和上下文减少空间自动化。有人发现,B-条款和A的发现 - C有连接就可以实现一个令人鼓舞的程度与标准停用词列表。此外,没有一个权重方案似乎足够了。对数似然似乎是导致的B-方面发现潜在有效,而这两个比值比和简单的共现频率既促进A的发现 - C有连接。对于发现机制,既语义相似性(通过余弦)和信息流的计算似乎有前途的计算A - C连接,但需要更多的研究来了解它们的相对优势和劣势。发现在上下文降低语义空间显示混合的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号