FatMan vs. LittleBoy: Scaling Up Linear Algebraic Operations in Scale-Out Data Platforms

机译：Fatman vs. Littleboy：在扩展数据平台中缩放线性代数操作

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Linear algebraic operations such as matrix manipulations form the kernel of many machine learning and other crucial algorithms. Scaling up as well as scaling out such algorithms are highly desirable to enable efficient processing over millions of data points. To this end, we present a matrix manipulation approach to effectively scale-up each node in a scale-out data parallel platform such as Apache Spark. Specifically, we enable hardware acceleration for matrix multiplications in a distributed Spark setup without user intervention. Our approach supports both dense and sparse distributed matrices, and provides flexible control of acceleration by matrix density. We demonstrate the benefit of our approach for generalized matrix multiplication operations over large matrices with up to four billion elements. To connect the effectiveness of our approach with machine learning applications, we performed Gramian matrix computation via generalized matrix multiplications. Our experiments show that our approach achieves more than 2× performance speed-up, and up to 96.1% computation improvement, compared to a state of the art Spark MLlib for dense matrices.

机译：矩阵操作等线性代数操作形成了许多机器学习和其他关键算法的内核。缩放以及缩放此类算法非常希望能够高效地处理超过数百万数据点。为此，我们介绍了一种矩阵操纵方法，以在诸如Apache Spark之类的横向数据并行平台中有效地扩展每个节点。具体地，我们在没有用户干预的情况下，在分布式火花设置中启用矩阵乘法的硬件加速。我们的方法支持密集和稀疏的分布式矩阵，并通过矩阵密度提供灵活的加速控制。我们展示了我们对大型矩阵上的广义矩阵乘法操作的方法的好处，该矩阵高达四亿个元素。要将我们的方法与机器学习应用的效果连接，我们通过广义矩阵乘法执行克鲁米亚矩阵计算。我们的实验表明，与致密矩阵的艺术的状态相比，我们的方法达到了超过2倍的性能加速，高达96.1％的计算改进。

著录项

来源
《International Workshop on Parallel Data Storage Data Intensive Scalable Computing Systems;International Conference for High Performance Computing, Networking, Storage and Analysis》|2016年|1 v.|共6页
会议地点
作者
Luna Xu; Seung-Hwan Lim; Ali R. Butt; Sreenivas R. Sukumar; Ramakrishnan Kannan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类理论、方法;
关键词
Sparks; Sparse matrices; Hardware; Libraries; Acceleration; Data analysis; Kernel;

机译：火花;稀疏矩阵;硬件;图书馆;加速;数据分析;内核;

相似文献

外文文献
中文文献
专利

1. A platform for big data analytics on distributed scale-out storage system [J] . Kyar Nyo Aye, Thandar Thein International Journal of Big Data Intelligence . 2015,第2期

机译：分布式横向扩展存储系统上的大数据分析平台
2. Measuring Scale-Up and Scale-Out Hadoop with Remote and Local File Systems and Selecting the Best Platform [J] . Zhuozhao Li, Haiying Shen IEEE Transactions on Parallel and Distributed Systems . 2017,第11期

机译：使用远程和本地文件系统测量Hadoop的横向扩展和横向扩展，并选择最佳平台
3. Design, scale-out, and operation of a preferential CO methanation reactor with a nickel-ceria catalyst [J] . M.M. Zyryanova, P.V. Snytnikov, Yu.I. Amosov Chemical engineering journal . 2011,第Null期

机译：具有镍铈催化剂的优先CO甲烷化反应器的设计，推广和运行
4. FatMan vs. LittleBoy: Scaling Up Linear Algebraic Operations in Scale-Out Data Platforms [C] . Luna Xu, Seung-Hwan Lim, Ali R. Butt, 1st Joint International Workshop on Parallel Data Storage amp; Data Intensive Scalable Computing Systems . 2016

机译：FatMan与LittleBoy：在横向扩展数据平台中扩大线性代数运算
5. Applied Machine Learning for Resource Provisioning of Data-Intensive Applications on Scale-Out Platforms and Its Security Challenges [D] . Mohammadi Makrani, Hosein. 2021

机译：应用机器学习，用于资源配置数据密集型应用在扩展平台及其安全挑战中
6. Anatomically accurate high resolution modeling of human whole heart electromechanics: A strongly scalable algebraic multigrid solver method for nonlinear deformation [O] . Christoph M. Augustin, Aurel Neic, Manfred Liebmann, -1

机译：人类全心脏机电的解剖学精确高分辨率建模：一种用于非线性变形的强可扩展代数多重网格求解器方法
7. An Analysis of Load Imbalance in Scale-out Data Serving [O] . Novakovic, Stanko, Daglis, Alexandros, Bugnion, Edouard, 2016

机译：横向扩展数据服务中的负载不平衡分析

FatMan vs. LittleBoy: Scaling Up Linear Algebraic Operations in Scale-Out Data Platforms

摘要

著录项

相似文献

相关主题

期刊订阅