首页> 外文会议>International Conference on Data Engineering >Towards a Holistic Integration of Spreadsheets with Databases: A Scalable Storage Engine for Presentational Data Management
【24h】

Towards a Holistic Integration of Spreadsheets with Databases: A Scalable Storage Engine for Presentational Data Management

机译:朝着带有数据库的电子表格的整体集成:可伸缩存储引擎,用于呈现数据管理

获取原文

摘要

Spreadsheet software is the tool of choice for interactive ad-hoc data management, with adoption by billions of users. However, spreadsheets are not scalable, unlike database systems. On the other hand, database systems, while highly scalable, do not support interactivity as a first-class primitive. We are developing DATASPREAD, to holistically integrate spreadsheets as a front-end interface with databases as a back-end datastore, providing scalability to spreadsheets, and interactivity to databases, an integration we term presentational data management (PDM). In this paper, we make the first step towards this vision for relational databases: developing a storage engine for PDM, studying how to flexibly represent spreadsheet data within a relational database and how to support and maintain access by position. We first conduct an extensive survey of spreadsheet use to motivate our functional requirements for a storage engine for PDM. We develop a natural set of mechanisms for flexibly representing spreadsheet data and demonstrate that identifying the optimal representation is NP-HARD; however, we develop an efficient approach to identify the optimal representation from an important and intuitive subclass of representations. We extend our mechanisms with positional access mechanisms that don't suffer from cascading update issues, leading to constant time access and modification performance. We evaluate these representations on a workload of typical spreadsheets and spreadsheet operations, providing up to 50% reduction in storage, and up to 50% reduction in formula evaluation time.
机译:电子表格软件是交互式Ad-hoc数据管理的首选工具,通过数十亿用户采用。但是,与数据库系统不同,电子表格不可扩展。另一方面,数据库系统,同时高度可扩展,不支持作为一流基元的交互性。我们正在开发DataSpRead,将电子表格集成为与数据库作为后端数据库的前端接口,为电子表格提供可扩展性,以及数据库的交互性,是我们术语表示数据管理(PDM)的集成性。在本文中,我们对关系数据库进行了第一步:开发用于PDM的存储引擎,研究如何灵活地代表关系数据库中的电子表格数据以及如何按位置支持和维护访问。我们首先对电子表格进行了广泛的调查,以激励我们对PDM存储引擎的功能要求。我们开发了一种自然的机制,用于灵活地代表电子表格数据,并证明识别最佳表示是NP-HARD;但是,我们开发了一种有效的方法,以确定来自陈述的重要和直观的子类的最佳表示。我们将我们的机制扩展了使用级联更新问题的位置访问机制,导致恒定的时间访问和修改性能。我们在典型电子表格和电子表格操作的工作量上评估这些陈述,可降低储存量高达50%,降低公式评估时间的50%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号