...
首页> 外文期刊>Swarm and Evolutionary Computation >A distributed evolutionary multivariate discretizer for Big Data processing on Apache Spark
【24h】

A distributed evolutionary multivariate discretizer for Big Data processing on Apache Spark

机译:用于Apache Spark的大数据处理的分布式进化多变量分离器

获取原文
获取原文并翻译 | 示例

摘要

Nowadays the phenomenon of Big Data is overwhelming our capacity to extract relevant knowledge through classical machine learning techniques. Discretization (as part of data reduction) is presented as a real solution to reduce this complexity. However, standard discretizers are not designed to perform well with such amounts of data. This paper proposes a distributed discretization algorithm for Big Data analytics based on evolutionary optimization. After comparing with a distributed discretizer based on the Minimum Description Length Principle, we have found that our solution yields more accurate and simpler solutions in reasonable time.
机译:如今,大数据的现象是通过古典机器学习技术来提取相关知识的能力。 离散化(作为数据减少的一部分)作为一个真实解决方案,以降低这种复杂性。 但是,标准自行设定者并不设计用于使用这种数据进行良好。 本文提出了一种基于进化优化的大数据分析分布式离散化算法。 在与基于最小描述长度原理的分布式分离器比较后,我们发现我们的解决方案在合理的时间内产生更准确和更简单的解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号