首页> 外文会议>International Conference on Foundations of Computer Science >Optimization of out-of-core data preparation methods identifying runtime determining factors
【24h】

Optimization of out-of-core data preparation methods identifying runtime determining factors

机译:优化核心数据准备方法识别运行时确定因子

获取原文

摘要

In the data preparation phase of a data mining task the raw, fine granulated data has to be transformed according to analytical aims into a more compact form in order to represent data at a higher abstraction level suitable for machine processing and human understanding as well. Vast datasets require sophisticated, out-of-core methods, which are prepared to handle these datasets using external storages during their execution. In this paper we investigate different pre-processing approaches to overcome the limitation of the size of the main memory from theoretical and practical points of view. We propose possible alternatives for different processing scenarios. Both of the proposed out-of-core algorithms are capable of processing datasets which are by orders of magnitude larger than the main memory; all this is done in a fault-tolerant way and even on an average PC.
机译:在数据准备阶段的数据挖掘任务中,必须根据分析旨在转换为更紧凑的形式,以表示适合机器处理和人类理解的更高抽象水平的数据。庞大的数据集需要复杂的核心方法,这些方法准备使用外部存储在执行期间处理这些数据集。在本文中,我们调查了不同的预处理方法,以克服主要内存大小的限制,从理论和实际的观点来看。我们为不同的处理方案提出了可能的替代方案。建议的核心算法中的两个都能够处理与大于主存储器大的数量级的数据集;所有这些都以容错的方式完成,甚至是平均PC。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号