首页> 外文会议>Data Engineering, ICDE, 2009 IEEE 25th International Conference on >A Rule-Based Classification Algorithm for Uncertain Data
【24h】

A Rule-Based Classification Algorithm for Uncertain Data

机译:基于规则的不确定数据分类算法

获取原文

摘要

Data uncertainty is common in real-world applications due to various causes, including imprecise measurement, network latency, outdated sources and sampling errors. These kinds of uncertainty have to be handled cautiously, or else the mining results could be unreliable or even wrong. In this paper, we propose a new rule-based classification and prediction algorithm called uRule for classifying uncertain data. This algorithm introduces new measures for generating, pruning and optimizing rules. These new measures are computed considering uncertain data interval and probability distribution function. Based on the new measures, the optimal splitting attribute and splitting value can be identified and used for classification and prediction. The proposed uRule algorithm can process uncertainty in both numerical and categorical data. Our experimental results show that uRule has excellent performance even when data is highly uncertain.
机译:由于各种原因,包括不确定性的测量,网络延迟,过时的源和采样错误,数据不确定性在现实世界的应用程序中很常见。必须谨慎处理这类不确定性,否则采矿结果可能不可靠甚至是错误的。在本文中,我们提出了一种新的基于规则的分类和预测算法,称为uRule,用于对不确定数据进行分类。该算法引入了用于生成,修剪和优化规则的新措施。考虑到不确定的数据间隔和概率分布函数来计算这些新的度量。基于新措施,可以识别最佳分割属性和分割值,并将其用于分类和预测。提出的uRule算法可以处理数值和分类数据中的不确定性。我们的实验结果表明,即使在数据高度不确定的情况下,uRule也具有出色的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号