首页> 外文学位 >View learning: A statistical relational approach to mining biomedical databases.
【24h】

View learning: A statistical relational approach to mining biomedical databases.

机译:视图学习:一种用于挖掘生物医学数据库的统计关系方法。

获取原文
获取原文并翻译 | 示例

摘要

This dissertation develops and evaluates statistical relational learning (SRL) algorithms that can automatically alter the schema of a database by learning new field and table definitions. The algorithmic advances made in this dissertation are motivated by important problems such as providing decision support system for radiologists who read mammograms, predicting three-dimensional Quantitative Structure-Activity Relationships for drug design, and performing entity resolution---the task of recognizing when two molecules, patients, biological pathways, etc. are actually the same.;This dissertation introduces view learning, the ability to automatically alter the schema of a database through the addition of new fields or tables, for SRL, and presents two algorithms for augmenting the schema of a database by adding new fields. It then extends view learning by developing an algorithm for performing predicate invention. We demonstrate the utility of view learning for SRL in two ways. First, it learns significantly more accurate models on a wide variety of domains. Second, it uncovers important and useful knowledge in these domains. For example, it identified a novel feature from a mammography report that is indicative of malignancy.;Motivated by the preceding work, the last part of the dissertation investigates the relationship between receiver operator characteristic (ROC) space and precision-recall (PR) space. Among other contributions, this part proves that for a fixed number of positive and negative examples, one curve dominates another curve in ROC space if and only if the first curve dominates the second curve in PR space. This result implies the existence of an analog to the convex hull for PR space, which we call the achievable PR curve, and it provides an efficient algorithm for constructing the achievable curve.
机译:本文开发并评估了统计关系学习算法,该算法可以通过学习新的字段和表定义来自动更改数据库的模式。本论文在算法上的进展主要是受到一些重要问题的推动,例如为放射线检查人员提供决策支持系统,以读取乳腺X线照片,预测三维定量构效关系以进行药物设计,以及执行实体解析-识别何时两个分子,患者,生物学途径等实际上是相同的。本论文介绍了视图学习,通过为SRL添加新字段或表来自动更改数据库模式的功能,并提出了两种算法来增强通过添加新字段的数据库架构。然后,它通过开发执行谓词发明的算法来扩展视图学习。我们以两种方式演示了用于SRL的视图学习的实用性。首先,它可以在多种领域中学习更为精确的模型。其次,它揭示了这些领域中重要且有用的知识。例如,它从乳腺X线摄影报告中发现了一个指示恶性肿瘤的新特征。在前面的工作的推动下,论文的最后一部分研究了接收者操作员特征(ROC)空间与精确召回(PR)空间之间的关系。 。除其他贡献外,该部分证明,对于固定数量的正例和负例,当且仅当第一条曲线在PR空间中的第二条曲线上占优势时,一条曲线才在ROC空间中的另一条曲线上占优势。这个结果意味着存在一个PR空间凸包的类似物,我们称它为可实现的PR曲线,它为构建可实现的曲线提供了一种有效的算法。

著录项

  • 作者

    Davis, Jesse Jon.;

  • 作者单位

    The University of Wisconsin - Madison.;

  • 授予单位 The University of Wisconsin - Madison.;
  • 学科 Artificial Intelligence.;Computer Science.
  • 学位 Ph.D.
  • 年度 2007
  • 页码 147 p.
  • 总页数 147
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号