首页> 外文会议>Data Integration in the Life Sciences >Exploiting Ontology Structure and Patterns ofAnnotation to Mine Significant Associations between Pairs of Controlled Vocabulary Terms
【24h】

Exploiting Ontology Structure and Patterns ofAnnotation to Mine Significant Associations between Pairs of Controlled Vocabulary Terms

机译:利用本体结构和注释模式对成对的受控词汇词之间的重要关联

获取原文
获取原文并翻译 | 示例

摘要

There is significant knowledge captured through annotations on the life sciences Web. In past research, we developed a methodology of support and confidence metrics from association rule mining, to mine the association bridge (of termlinks) between pairs of controlled vocabulary (CV) terms across two ontologies. Our (naive) approach did not exploit the following: implicit knowledge captured via the hierarchical is-a structure of ontologies, and patterns of annotation in datasets that may impact the distribution of parent/child or sibling CV terms. In this research, we consider this knowledge. We aggregate termlinks over the siblings of a parent CV term and use them as additional evidence to boost support and confidence scores in the associations of the parent CV term. A weight factor (α) reflects the contribution from the child CV terms; its value can be varied to reflect a variance of confidence values among the sibling CV terms of some parent CV term. We illustrate the benefits of exploiting this knowledge through experimental evaluation.
机译:通过生命科学Web上的注释可以捕获大量知识。在过去的研究中,我们从关联规则挖掘中开发了一种支持和置信度度量的方法,以挖掘跨两个本体的成对受控词汇(CV)词对之间的(术语链接)关联桥梁。我们的(幼稚)方法没有利用以下内容:通过分层is-本体结构捕获的隐性知识,以及可能影响父/子或同级CV术语分布的数据集中的注释模式。在这项研究中,我们考虑了这一知识。我们汇总上一级CV术语的兄弟姐妹的术语链接,并将其用作增加在上一级CV术语的关联中的支持度和置信度得分的附加证据。权重因子(α)反映了子CV项的贡献;它的值可以变化以反映某些父CV术语的同级CV术语之间的置信度值的变化。我们通过实验评估说明了利用这些知识的好处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号