【24h】

Learning what to read: Focused machine reading

机译:学习阅读内容:专注于机器阅读

获取原文

摘要

Recent efforts in bioinformatics have achieved tremendous progress in the machine reading of biomedical literature, and the assembly of the extracted biochemical interactions into large-scale models such as protein signaling pathways. However, batch machine reading of literature at today's scale (PubMed alone indexes over 1 million papers per year) is unfeasible due to both cost and processing overhead. In this work, we introduce a focused reading approach to guide the machine reading of biomedical literature towards what literature should be read to answer a biomedical query as efficiently as possible. We introduce a family of algorithms for focused reading, including an intuitive, strong baseline, and a second approach which uses a reinforcement learning (RL) framework that learns when to explore (widen the search) or exploit (narrow it) We demonstrate that the RL approach is capable of answering more queries than the baseline, while being more efficient, i.e , reading fewer documents.
机译:在生物信息学的机器阅读以及将提取的生物化学相互作用组装成大规模模型(例如蛋白质信号通路)的过程中,生物信息学的最新努力取得了巨大进展。但是,由于成本和处理开销的原因,当今规模的批量机器阅读文学作品(仅PubMed每年索引超过一百万篇论文)是不可行的。在这项工作中,我们介绍了一种集中阅读方法,以指导对生物医学文献进行机器阅读,以指导应阅读哪些文献以尽可能有效地回答生物医学查询。我们介绍了一系列针对重点阅读的算法,包括直观,强大的基线,以及使用强化学习(RL)框架的第二种方法,该框架学习何时进行探索(扩大搜索范围)或利用(缩小范围)。 RL方法能够回答比基线更多的查询,同时效率更高,即读取更少的文档。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号