首页> 外文学位 >Biomedical literature mining with transitive closure and maximum network flow.
【24h】

Biomedical literature mining with transitive closure and maximum network flow.

机译:生物医学文献挖掘,具有传递性封闭和最大的网络流量。

获取原文
获取原文并翻译 | 示例

摘要

The biological literature is a huge and constantly increasing source of information which the biologist may consult for information about their field, but the vast amount of data can sometimes become overwhelming. Medline, which makes a great amount of biological journal data available online, makes the development of automated text mining systems and hence "data-driven discovery" possible. This thesis examines current work in the field of text mining and biological literature, and then aims to mine documents pertaining to bone biology. The documents are retrieved from PubMed, and then direct associations between the terms are computers. Potentially novel transitive associations among biological objects are then discovered using the transitive closure algorithm and the maximum flow algorithm. The thesis discusses in detail the extraction of biological objects from the collected documents and the co-occurrence based text mining algorithm, the transitive closure algorithm, and the maximum network flow which were then run to extract the potentially novel biological associations. Generated hypotheses (novel associations) were assigned with significance scores for further validation by a bone biologist expert. Extension of the work in to hypergraphs for enhanced meaning and accuracy is also examined in the thesis.
机译:生物学文献是一个巨大且不断增长的信息来源,生物学家可以向生物学文献咨询以获取有关其领域的信息,但是有时大量的数据会变得不知所措。 Medline可以在线提供大量生物期刊数据,从而使自动化文本挖掘系统的开发以及“数据驱动的发现”成为可能。本文研究了文本挖掘和生物学文献领域的当前工作,然后旨在挖掘与骨骼生物学有关的文献。从PubMed中检索文档,然后术语之间的直接关联是计算机。然后使用传递封闭算法和最大流量算法发现生物对象之间潜在的新颖传递关联。本文详细讨论了从收集的文档中提取生物对象的方法,以及基于共现的文本挖掘算法,传递闭包算法和最大网络流量,然后运行这些算法来提取潜在的新型生物关联。生成的假设(新颖的关联)分配有显着性评分,以供骨生物学专家进一步验证。本文还研究了将工作扩展到超图上的意义和准确性。

著录项

  • 作者

    Hoblitzell, Andrew P.;

  • 作者单位

    Purdue University.;

  • 授予单位 Purdue University.;
  • 学科 Biology Bioinformatics.;Computer Science.;Information Science.
  • 学位 M.S.
  • 年度 2011
  • 页码 64 p.
  • 总页数 64
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号