首页> 外文会议>IEEE International Congress on Big Data >On the Usage of the Probability Integral Transform to Reduce the Complexity of Multi-Way Fuzzy Decision Trees in Big Data Classification Problems
【24h】

On the Usage of the Probability Integral Transform to Reduce the Complexity of Multi-Way Fuzzy Decision Trees in Big Data Classification Problems

机译:论概率积分变换的用法,以降低大数据分类问题中多路模糊决策树的复杂性

获取原文

摘要

We present a new distributed fuzzy partitioning method to reduce the complexity of multi-way fuzzy decision trees in Big Data classification problems. The proposed algorithm builds a fixed number of fuzzy sets for all variables and adjusts their shape and position to the real distribution of training data. A two-step process is applied : 1) transformation of the original distribution into a standard uniform distribution by means of the probability integral transform. Since the original distribution is generally unknown, the cumulative distribution function is approximated by computing the q-quantiles of the training set; 2) construction of a Ruspini strong fuzzy partition in the transformed attribute space using a fixed number of equally distributed triangular membership functions. Despite the aforementioned transformation, the definition of every fuzzy set in the original space can be recovered by applying the inverse cumulative distribution function (also known as quantile function). The experimental results reveal that the proposed methodology allows the state-of-the-art multi-way fuzzy decision tree (FMDT) induction algorithm to maintain classification accuracy with up to 6 million fewer leaves.
机译:我们提出了一个新的分布式模糊划分方法,以减少在大数据分类问题多向模糊决策树的复杂性。该算法建立了所有的变量固定数量的模糊集,并调整它们的形状和位置训练数据的真实分布。两步工艺应用于:1)由概率积分手段的原始分发到一个标准的均匀分布的变换变换。由于原来的分布一般是未知的,累积分布函数是通过计算训练集的q位数近似; 2)使用的均等分布的三角形的隶属函数的固定数目在转化的属性空间Ruspini强模糊分区的结构。尽管上述的变换,在原空间中的每个模糊集合的定义可以通过应用逆累积分布函数(也称为位点函数)来回收。实验结果表明,所提出的方法,使国家的最先进的多路模糊决策树(FMDT)归纳算法,以保持分类精度高达600万个少叶。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号