首页> 外文会议>International Workshop on Formal Methods in Systems Biology >Exploiting Ontology Structure and Patterns of Annotation to Mine Significant Associations between Pairs of Controlled Vocabulary Terms
【24h】

Exploiting Ontology Structure and Patterns of Annotation to Mine Significant Associations between Pairs of Controlled Vocabulary Terms

机译:利用本体结构与注释模式,以挖掘对照词汇术语对之间的重要关联

获取原文

摘要

There is significant knowledge captured through annotations on the life sciences Web. In past research, we developed a methodology of support and confidence metrics from association rule mining, to mine the association bridge (of termlinks) between pairs of controlled vocabulary (CV) terms across two ontologies. Our (naive) approach did not exploit the following: implicit knowledge captured via the hierarchical is-a structure of ontologies, and patterns of annotation in datasets that may impact the distribution of parent/child or sibling CV terms. In this research, we consider this knowledge. We aggregate termlinks over the siblings of a parent CV term and use them as additional evidence to boost support and confidence scores in the associations of the parent CV term. A weight factor (α) reflects the contribution from the child CV terms; its value can be varied to reflect a variance of confidence values among the sibling CV terms of some parent CV term. We illustrate the benefits of exploiting this knowledge through experimental evaluation.
机译:通过生命科学网站的注释捕获了重要知识。在过去的研究中,我们从关联规则挖掘开发了一种支持和置信度量的方法,以在两个本体中的受控词汇(CV)条款对之间的关​​联网桥(术语)挖掘。我们的(天真)方法没有利用以下内容:通过层次结构捕获的隐式知识 - 是一个本体结构的结构,以及数据集的注释模式,可能影响父/儿童或兄弟cv术语的分布。在这项研究中,我们考虑到这一知识。我们将终端链接在父CV项的兄弟阶段上,并使用它们作为额外的证据,以提高母体CV项的关联中的支持和置信度分数。重量因子(α)反映了儿童CV术语的贡献;它的价值可以变化,以反映一些母体CV项的兄弟CV术语之间的置信度值的变化。我们说明了通过实验评估利用这些知识的好处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号