首页> 外文会议>International Conference on Cloud Computing and Big Data >Synthetic Data Generator for Classification Rules Learning
【24h】

Synthetic Data Generator for Classification Rules Learning

机译:用于分类规则学习的综合数据生成器

获取原文

摘要

A standard data set is useful to empirically evaluate classification rules learning algorithms. However, there is still no standard data set which is common enough for various situations. Data sets from the real world are limited to specific applications. The sizes of attributes, the rules and samples of the real data are fixed. A data generator is proposed here to produce synthetic data set which can be as big as the experiments demand. The size of attributes, rules, and samples of the synthetic data sets can be easily changed to meet the demands of evaluation on different learning algorithms. In the generator, related attributes are created at first. And then, rules are created based on the attributes. Samples are produced following the rules. Three decision tree algorithms are evaluated used synthetic data sets produced by the proposed data generator.
机译:标准数据集可用于根据经验评估分类规则学习算法。但是,仍然没有适用于各种情况的足够通用的标准数据集。来自现实世界的数据集仅限于特定的应用程序。属性的大小,规则和实际数据的样本是固定的。这里建议使用一个数据生成器来生成可以与实验需求一样大的合成数据集。可以轻松更改合成数据集的属性,规则和样本的大小,以满足对不同学习算法进行评估的需求。在生成器中,首先创建相关属性。然后,基于属性创建规则。遵循规则生产样品。评估了三种决策树算法,该算法使用了拟议的数据生成器生成的合成数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号