首页> 外文会议>Proceedings of the 5th international workshop on Bioinformatics >On discovery of maximal confident rules without support pruning in microarray data
【24h】

On discovery of maximal confident rules without support pruning in microarray data

机译:在不支持微阵列数据修剪的情况下发现最大置信规则

获取原文
获取原文并翻译 | 示例

摘要

Microarray data provides a perfect riposte to the original assumption underlying association rule mining -- large but sparse transaction sets. In a typical microarray the number of columns (genes) is an order of magnitude larger than the number of rows (experiments). A new family of row enumerated rule mining algorithms have emerged to facilitate mining in dense sets. However, to date, all the algorithms proposed to mine expression relationships alone rely on the support measure to prune the search space. This is a major shortcoming as it results in the pruning of many potentially interesting rules which have low support but high confidence. In this paper we propose the MAXCONF algorithm which exploits the weak downward closure of confidence to directly mine for high confidence rules. We also provide a means to evaluate the biological significance of the gene relationships identified. An evaluation of MAXCONF with RERII on the database BIND shows that their recall is 94% and .15% respectively.
机译:微阵列数据为关联规则挖掘基础的原始假设(大型但稀疏的交易集)提供了完美的解决方法。在典型的微阵列中,列(基因)的数量比行(实验)的数量大一个数量级。一个新的行枚举规则挖掘算法家族已经出现,以促进密集集中的挖掘。但是,迄今为止,所有提出的仅挖掘表达关系的算法都依赖于支持措施来修剪搜索空间。这是一个主要缺点,因为它会导致许多潜在的有趣规则被删除,这些规则的支持率较低,但可信度很高。在本文中,我们提出了MAXCONF算法,该算法利用置信度的弱向下闭合直接挖掘高置信度规则。我们还提供了一种方法来评估已确定的基因关系的生物学意义。在数据库BIND上对带有RERII的MAXCONF的评估表明,它们的召回率分别为94%和.15%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号