首页> 外文会议>International Conference on Big Data, Small Data, Linked Data and Open Data >A Comparison of Classification Systems for Rule Sets Induced from Incomplete Data by Probabilistic Approximations
【24h】

A Comparison of Classification Systems for Rule Sets Induced from Incomplete Data by Probabilistic Approximations

机译:概率近似从不完全数据引起的规则集的分类系统比较

获取原文

摘要

In this paper, we compare four strategies used in classification systems. A classification system applies a rule set, induced from the training data set in order to classify each testing case as a member of one of the concepts. We assume that both training and testing data sets are incomplete, i.e., some attribute values are missing. In this paper, we discuss two interpretations of missing attribute values: lost values and "do not care" conditions. In our experiments rule sets were induced using probabilistic approximations. Our main results are that for lost value data sets the strength only strategy is better than conditional probability without support and that for "do not care" data sets the conditional probability with support strategy is better than strength only.
机译:在本文中,我们比较分类系统中使用的四种策略。分类系统应用从训练数据集引起的规则集,以便将每个测试用例分类为其中一个概念的成员。我们假设培训和测试数据集都是不完整的,即,缺少某些属性值。在本文中,我们讨论了两个缺少属性值的解释:损失值和“不关心”条件。在我们的实验中,使用概率近似引起规则集。我们的主要结果是,对于损失的价值数据集,强度唯一的策略优于条件概率而不支持,因为“不关心”数据设置有条件概率,支持策略仅优于强度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号