The DBMS - your big data sommelier

机译：DBMS - 您的大数据索莫尔

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

When addressing the problem of “big” data volume, preparation costs are one of the key challenges: the high costs for loading, aggregating and indexing data leads to a long data-to-insight time. In addition to being a nuisance to the end-user, this latency prevents real-time analytics on “big” data. Fortunately, data often comes in semantic chunks such as files that contain data items that share some characteristics such as acquisition time or location. A data management system that exploits this trait can significantly lower the data preparation costs and the associated data-to-insight time by only investing in the preparation of the relevant chunks. In this paper, we develop such a system as an extension of an existing relational DBMS (MonetDB). To this end, we develop a query processing paradigm and data storage model that are partial-loading aware. The result is a system that can make a 1.2 TB dataset (consisting of 4000 chunks) ready for querying in less than 3 minutes on a single server-class machine while maintaining good query processing performance.

机译：在解决“大”数据量的问题时，准备成本是关键挑战之一：加载，聚合和索引数据的高成本导致长数据到洞察时间。除了对最终用户的滋扰之外，此延迟还可以防止“大”数据上的实时分析。幸运的是，数据通常来自语义块，例如包含共享一些特征的数据项，例如获取时间或位置。利用这种特征的数据管理系统可以通过仅在编写相关块的准备情况下显着降低数据准备成本和相关数据到洞察时间。在本文中，我们开发这样的系统作为现有关系DBMS的扩展（MONETDB）。为此，我们开发了一个查询处理范例和数据存储模型，它是部分加载感知的。结果是一个系统，可以制作1.2 TB数据集（由4000个块组成），准备在单个服务器类计算机上不到3分钟内查询，同时保持良好的查询处理性能。

著录项

来源
《IEEE international conference on data engineering》|2015年||共12页
会议地点
作者
Kargin Yagiz; Kersten Martin; Manegold Stefan; Pirk Holger;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机软件;
关键词

相似文献

外文文献
中文文献
专利

1. Data in the time of COVID-19: a general methodology to select and secure a NoSQL DBMS for medical data [J] . Kamal A. ElDahshan, AbdAllah A. AlHabshy, Gaber E. Abutaleb PeerJ Computer Science . 2020,第1期

机译：Covid-19中的数据：用于为医疗数据选择和保护NoSQL DBMS的一般方法
2. Necessity to Design of New DBMS Platforms for Data Analysis in Market-oriented Cloud Computing: Properties and Limitations of Data Analysis [J] . Liladhar R. Rewatkar, Ujwal A. Lanjewar Journal of Computational Intelligence in Bioinformatics . 2016,第2期

机译：设计面向市场的云计算中用于数据分析的新DBMS平台的必要性：数据分析的属性和局限性
3. Using Data Compression for Increasing Efficiency of Data Transfer Between Main Memory and Intel Xeon Phi Coprocessor or NVidia GPU in Parallel DBMS [J] . Konstantin Y. Besedin, Pavel S. Kostenetskiy, Stepan O. Prikazchikov Procedia Computer Science . 2015,第1期

机译：使用数据压缩来提高并行DBMS中主内存与Intel Xeon Phi协处理器或NVidia GPU之间的数据传输效率
4. The DBMS - your big data sommelier [C] . Kargin Yagiz, Kersten Martin, Manegold Stefan, IEEE international conference on data engineering . 2015

机译：DBMS-您的大数据侍酒师
5. Novel Selectivity Estimation Strategy for Modern DBMS [D] . Shin, Jun Hyung 2018

机译：现代DBMS的新型选择性估计策略
6. Evaluation of bone healing in canine tibial defects filled with cortical autograft commercial-DBM calf fetal DBM omentum and omentum-calf fetal DBM [O] . Amin Bigham-Sadegh, Iraj Karimi, Mahsa Alebouye, 2013

机译：评估自体皮质移植物市售DBM小牛胎儿DBM大网膜和大网膜小牛胎儿DBM填充的胫胫骨缺损的骨愈合情况
7. The DBMS - your Big Data Sommelier [O] . Kargın, Y., Kersten, M., Manegold, S., 2015

机译：DBMS-您的大数据侍酒师
8. Develop an Automated Data Base Management System (DBMS): Report on DBMS Software and User's Guide: Final Report, Task 2 [R] . 1987

机译：开发自动数据库管理系统（DBms）：DBms软件和用户指南报告：最终报告，任务2

The DBMS - your big data sommelier

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅