...
首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >A Parallel Matrix-Based Method for Computing Approximations in Incomplete Information Systems
【24h】

A Parallel Matrix-Based Method for Computing Approximations in Incomplete Information Systems

机译:不完全信息系统中基于并行矩阵的近似计算方法

获取原文
获取原文并翻译 | 示例

摘要

As the volume of data grows at an unprecedented rate, large-scale data mining and knowledge discovery present a tremendous challenge. Rough set theory, which has been used successfully in solving problems in pattern recognition, machine learning, and data mining, centers around the idea that a set of distinct objects may be approximated via a lower and upper bound. In order to obtain the benefits that rough sets can provide for data mining and related tasks, efficient computation of these approximations is vital. The recently introduced cloud computing model, MapReduce, has gained a lot of attention from the scientific community for its applicability to large-scale data analysis. In previous research, we proposed a MapReduce-based method for computing approximations in parallel, which can efficiently process complete data but fails in the case of missing (incomplete) data. To address this shortcoming, three different parallel matrix-based methods are introduced to process large-scale, incomplete data. All of them are built on MapReduce and implemented on Twister that is a lightweight MapReduce runtime system. The proposed parallel methods are then experimentally shown to be efficient for processing large-scale data.
机译:随着数据量以前所未有的速度增长,大规模数据挖掘和知识发现提出了巨大的挑战。粗糙集理论已成功用于解决模式识别,机器学习和数据挖掘中的问题,其核心思想是可以通过上下限近似一组不同的对象。为了获得粗糙集可以为数据挖掘和相关任务提供的好处,有效计算这些近似值至关重要。最近引入的云计算模型MapReduce因其在大规模数据分析中的适用性而引起了科学界的广泛关注。在先前的研究中,我们提出了一种基于MapReduce的并行计算近似方法,该方法可以有效地处理完整数据,但是在丢失(不完整)数据的情况下会失败。为了解决此缺点,引入了三种不同的基于并行矩阵的方法来处理大规模不完整数据。它们全部基于MapReduce构建,并在作为轻量级MapReduce运行时系统的Twister上实现。然后,实验证明了所提出的并行方法对于处理大规模数据是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号