首页> 外文会议>2010 Fourth International Conference on Research Challenges in Information Science >A novel decomposition algorithm for binary datatables: Encouraging results on discrimination tasks
【24h】

A novel decomposition algorithm for binary datatables: Encouraging results on discrimination tasks

机译:一种新颖的二进制数据表分解算法:鼓励区分任务的结果

获取原文
获取原文并翻译 | 示例

摘要

We present here an algorithm for decomposing any binary datatable into a set of “sufficient itemsets”, i.e. a non-redundant list of itemsets adequate for reconstructing the whole table up to a permutation of the rows. For doing so, we have replaced the “support” threshold criterion of the well-known Apriori algorithm by a “number of liberties”: the liberty count expresses how a (k+1)-level itemset is constrained by its k-level “parents”, till the level when the situation turns frozen. Our algorithm is symmetric: we take into account the absence of items as well as their presence in our itemsets. Conversely, we present a method for reconstituting the original data starting from our exact MIDOVA representation. We illustrate these points with the examples of Breast Cancer and Mushroom datasets from UCI Repository. We validate our approach by deriving a learning classifier approach and applying it to three discrimination problems drawn from the above-mentioned repository.
机译:在这里,我们提出一种算法,用于将任何二进制数据表分解为一组“足够的项集”,即足以重构整个表直至行排列的项集的非冗余列表。为此,我们用“自由数”代替了著名的Apriori算法的“支持”阈值标准:自由数表示(k + 1)级项目集如何受其k级“父母”,直到局势变得僵持为止。我们的算法是对称的:我们考虑了项目中不存在项目以及项目中存在项目的情况。相反,我们提出了一种从我们的精确MIDOVA表示开始重构原始数据的方法。我们以UCI储存库中的乳腺癌和蘑菇数据集为例说明了这些观点。我们通过推导学习分类器方法并将其应用于从上述存储库中得出的三个歧视问题来验证我们的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号