首页> 外文会议>2011 Seventh International Conference on Natural Computation >Efficient building algorithms of decision tree for uniformly distributed uncertain data
【24h】

Efficient building algorithms of decision tree for uniformly distributed uncertain data

机译:均匀分布不确定数据的决策树高效构建算法

获取原文

摘要

Developing algorithms for uncertain data is one of the most active themes in data mining community. A number of different decision tree classifiers have been studied in order to deal with uncertain data. This paper extends these works. In this paper, we develop a tree-pruning algorithm using sum of the tuples fractions based on probability theory. By pruning, we find that the accuracy of the classifier is improved and the efficiency of building the decision tree is also improved. Besides, we find that under the context of uniformly distribution, increasing the sampling density of the uncertain attribute value can make little contribution to improve the accuracy, but is computationally more costly. So we propose a new method of sampling. Using this sampling method, the execution time of building the decision tree is greatly decreased.
机译:为不确定数据开发算法是数据挖掘社区中最活跃的主题之一。为了处理不确定的数据,已经研究了许多不同的决策树分类器。本文扩展了这些工作。在本文中,我们基于概率论开发了一种使用元组分数之和的树修剪算法。通过修剪,我们发现分类器的准确性得到了提高,并且决策树的构建效率也得到了提高。此外,我们发现,在均匀分布的情况下,增加不确定属性值的采样密度对提高准确性几乎没有贡献,但在计算上却更加昂贵。因此,我们提出了一种新的抽样方法。使用这种采样方法,大大减少了构建决策树的执行时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号