首页> 外文会议>IEEE International Conference on Parallel and Distributed Systems >MatrixMap: Programming Abstraction and Implementation of Matrix Computation for Big Data Applications
【24h】

MatrixMap: Programming Abstraction and Implementation of Matrix Computation for Big Data Applications

机译:MatrixMap:大数据应用程序矩阵抽象的编程抽象和实现

获取原文

摘要

The computation core of many big data applications can be expressed as general matrix computations, including linear algebra operations and irregular matrix operations. However, existing parallel programming systems such as Spark do not have programming abstraction and efficient implementation for general matrix computations. In this paper, we present MatrixMap, a unified and efficient data-parallel system for general matrix computations. MatrixMap provides powerful yet simple abstraction, consisting of a distributed data structure called bulk key matrix and a computation interface defined by matrix patterns. Users can easily load data into bulk key matrices and program algorithms into parallel matrix patterns. MatrixMap outperforms current state-of-the-art systems by employing three key techniques: matrix patterns with lambda functions for irregular and linear algebra matrix operations, asynchronous computation pipeline with optimized data shuffling strategies for specific matrix patterns and in-memory data structure reusing data in iterations. Moreover, it can automatically handle the parallelization and distribute execution of programs on a large cluster. The experiment results show that MatrixMap is 12 times faster than Spark.
机译:许多大数据应用程序的计算核心可以表示为通用矩阵计算,包括线性代数运算和不规则矩阵运算。但是,现有的并行编程系统(例如Spark)没有用于一般矩阵计算的编程抽象和有效实现。在本文中,我们提出了MatrixMap,这是一个用于通用矩阵计算的统一且高效的数据并行系统。 MatrixMap提供了强大而简单的抽象,它由称为批量密钥矩阵的分布式数据结构和由矩阵模式定义的计算接口组成。用户可以轻松地将数据加载到批量密钥矩阵中,并将程序算法加载到并行矩阵模式中。 MatrixMap通过采用以下三种关键技术,胜过当前的最新系统:具有用于不规则和线性代数矩阵运算的lambda函数的矩阵模式,具有针对特定矩阵模式的优化数据改组策略的异步计算管道以及内存中的数据结构重用数据在迭代中。此外,它可以自动处理并行化并在大型群集上分发程序的执行。实验结果表明,MatrixMap比Spark快12倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号