首页> 外文会议>IEEE International Conference on Data Engineering >Scalable Linear Algebra on a Relational Database System
【24h】

Scalable Linear Algebra on a Relational Database System

机译:关系数据库系统上的可伸缩线性代数

获取原文

摘要

As data analytics has become an important application for modern data management systems, a new category of data management system has appeared recently: the scalable linear algebra system. In this paper, we argue that a parallel or distributed database system is actually an excellent platform upon which to build such functionality. Most relational systems already have support for cost-based optimization—which is vital to scaling linear algebra computations—and it is well-known how to make relational systems scale. We show that by making just a few changes to a parallel/ distributed relational database system, such a system can be a competitive platform for scalable linear algebra. Taken together, our results should at least raise the possibility that brand new systems designed from the ground up to support scalable linear algebra are not absolutely necessary, and that such systems could instead be built on top of existing relational technology. Our results also suggest that if scalable linear algebra is to be added to a modern dataflow platform such as Spark, they should be added on top of the system's more structured (relational) data abstractions, rather than being constructed directly on top of the system's raw dataflow operators.
机译:随着数据分析已成为现代数据管理系统的重要应用程序,最近出现了一种新型的数据管理系统:可扩展的线性代数系统。在本文中,我们认为并行或分布式数据库系统实际上是构建此类功能的出色平台。大多数关系系统已经支持基于成本的优化(这对扩展线性代数计算至关重要),如何使关系系统按比例缩放是众所周知的。我们表明,通过对并行/分布式关系数据库系统进行一些更改,这样的系统可以成为可伸缩线性代数的竞争平台。综上所述,我们的结果至少应该提出这样一种可能性:并不是绝对必要全新设计的系统来支持可扩展的线性代数,而是可以在现有的关系技术之上构建这样的系统。我们的结果还表明,如果要将可伸缩线性代数添加到现代数据流平台(例如Spark),则应将其添加到系统的结构化(关系)数据抽象之上,而不是直接在系统原始数据之上构建数据流运算符。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号