首页> 美国卫生研究院文献>other >Automatic Human-like Mining and Constructing Reliable Genetic Association Database with Deep Reinforcement Learning
【2h】

Automatic Human-like Mining and Constructing Reliable Genetic Association Database with Deep Reinforcement Learning

机译:深度强化学习的类人自动挖掘与构建可靠的遗传关联数据库

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The increasing amount of scientific literature in biological and biomedical science research has created a challenge in continuous and reliable curation of the latest knowledge discovered, and automatic biomedical text-mining has been one of the answers to this challenge. In this paper, we aim to further improve the reliability of biomedical text-mining by training the system to directly simulate the human behaviors such as querying the PubMed, selecting articles from queried results, and reading selected articles for knowledge. We take advantage of the efficiency of biomedical text-mining, the flexibility of deep reinforcement learning, and the massive amount of knowledge collected in UMLS into an integrative artificial intelligent reader that can automatically identify the authentic articles and effectively acquire the knowledge conveyed in the articles. We construct a system, whose current primary task is to build the genetic association database between genes and complex traits of human. Our contributions in this paper are three-fold: 1) We propose to improve the reliability of text-mining by building a system that can directly simulate the behavior of a researcher, and we develop corresponding methods, such as Bi-directional LSTM for text mining and Deep Q-Network for organizing behaviors. 2) We demonstrate the effectiveness of our system with an example in constructing a genetic association database. 3) We release our implementation as a generic framework for researchers in the community to conveniently construct other databases.
机译:生物和生物医学科学研究中科学文献数量的不断增加,对持续和可靠地管理发现的最新知识提出了挑战,而自动生物医学文本挖掘已成为应对这一挑战的方法之一。在本文中,我们旨在通过培训系统来直接模拟人类行为,例如查询PubMed,从查询结果中选择文章以及阅读所选文章以获取知识,从而进一步提高生物医学文本挖掘的可靠性。我们利用生物医学文本挖掘的效率,深度强化学习的灵活性以及在UMLS中收集的大量知识,将其集成到一个集成的人工智能阅读器中,该阅读器可以自动识别真实的文章并有效地获取文章中传达的知识。我们构建了一个系统,当前的主要任务是建立人类基因与复杂性状之间的遗传关联数据库。我们在本文中的贡献包括三个方面:1)我们建议通过构建一个可以直接模拟研究人员行为的系统来提高文本挖掘的可靠性,并开发相应的方法,例如用于文本的双向LSTM挖掘和Deep Q-Network来组织行为。 2)我们以构建遗传关联数据库为例,演示了我们系统的有效性。 3)我们将实现作为通用框架发布,以供社区研究人员方便地构建其他数据库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号