首页> 外文期刊>Swarm and Evolutionary Computation >Semantic genetic programming for fast and accurate data knowledge discovery
【24h】

Semantic genetic programming for fast and accurate data knowledge discovery

机译:语义遗传程序设计,可快速准确地发现数据知识

获取原文
获取原文并翻译 | 示例
           

摘要

Big data knowledge discovery emerged as an important factor contributing to advancements in society at large. Still, researchers continuously seek to advance existing methods and provide novel ones for analysing vast data sets to make sense of the data, extract useful information, and build knowledge to inform decision making. In the last few years, a very promising variant of genetic programming was proposed: geometric semantic genetic programming. Its difference with the standard version of genetic programming consists in the fact that it uses new genetic operators, called geometric semantic operators, that, acting directly on the semantics of the candidate solutions, induce by definition a unimodal error surface on any supervised learning problem, independently from the complexity and size of the underlying data set. This property should improve the evolvability of genetic programming in presence of big data and thus makes geometric semantic genetic programming an extremely promising method for mining vast amounts of data. Nevertheless, to the best of our knowledge, no contribution has appeared so far to employ this new technology to big data problems. This paper intends to fill this gap. For the first time, in fact, we show the effectiveness of geometric semantic genetic programming on several complex real-life problems, characterized by vast amounts of data, coming from several different application domains. (C) 2015 Elsevier B.V. All rights reserved.
机译:大数据知识发现已成为促进整个社会进步的重要因素。尽管如此,研究人员仍在不断寻求改进现有方法,并提供新颖的方法来分析庞大的数据集,以使数据有意义,提取有用的信息并积累知识以为决策提供依据。在最近几年中,提出了一种非常有前途的遗传编程变体:几何语义遗传编程。它与标准编程方案的不同之处在于,它使用了称为几何语义运算符的新遗传运算符,该遗传运算符直接作用于候选解的语义,并根据定义在任何有监督的学习问题上诱导出单峰错误面,与基础数据集的复杂性和大小无关。此属性应提高存在大数据时遗传编程的可进化性,因此使几何语义遗传编程成为挖掘大量数据的极有希望的方法。然而,据我们所知,到目前为止,将这种新技术用于大数据问题尚未做出任何贡献。本文旨在填补这一空白。实际上,这是第一次,我们首次展示了几何语义遗传程序设计对几个复杂的现实问题的有效性,这些问题的特点是来自多个不同应用领域的大量数据。 (C)2015 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号