首页> 外文会议>2014 Second World Conference on Complex Systems >Vertical query-join benchmark in a cloud database environment
【24h】

Vertical query-join benchmark in a cloud database environment

机译:云数据库环境中的垂直查询联接基准

获取原文
获取原文并翻译 | 示例

摘要

Nowadays, enterprises across all branches and sectors face a new hype regarding “Big Data”. Thus, new requirements in the context of Business Intelligence emerge. Big Data demands to process vast amounts of unstructured data from social networks, sensor data, etc. in near real-time. In order to tackle these challenges, current research works aim to develop new ways of data storage and analysis from a database point of view. This is the advent of so-called “In-Memory” databases (e.g. SAP HANA) that hold entire data volumes in their fast RAM memory and use hard disks only for logging or archiving purposes. Another promising technology with respect to this topic is "Cloud Computing". Storing and analyzing vast amounts of heterogeneous data require appropriate underlying hardware infrastructures. Obtaining such hardware capabilities form external cloud providers is an auspicious way to avoid expensive investments in new hardware. However, using external hardware resources from the public cloud always means that crucial data has to leave the internal enterprise network and enterprises have to trust external providers. Bringing "Big Data" into the cloud, our approach follows the principle of vertically distributed database tables. The main idea is to divide crucial database data and distribute it across different (public and private) cloud providers. Thus, every provider only gets a small part of the data. These individual small parts are worthless without the other parts and enable enterprises to meet their compliance rules concerning data security and protection. So Cloud Computing becomes an interesting alternative to store vast amounts of data. This work evaluates our approach from a performance point of view and presents the corresponding query times with and without vertically partitioned data.
机译:如今,所有分支机构和部门的企业都面临着有关“大数据”的新炒作。因此,在商业智能的背景下出现了新的要求。大数据要求接近实时地处理来自社交网络,传感器数据等的大量非结构化数据。为了应对这些挑战,当前的研究工作旨在从数据库的角度开发数据存储和分析的新方法。这就是所谓的“内存中”数据库(例如SAP HANA)的问世,这些数据库将整个数据卷保存在其快速RAM内存中,并且仅将硬盘用于记录或归档目的。关于此主题的另一种有前途的技术是“云计算”。存储和分析大量异构数据需要适当的基础硬件基础架构。从外部云提供商那里获得这种硬件功能是一种避免对新硬件进行昂贵投资的吉祥方式。但是,使用来自公共云的外部硬件资源始终意味着关键数据必须离开企业内部网络,而企业必须信任外部提供商。将“大数据”引入云中,我们的方法遵循垂直分布的数据库表的原理。主要思想是划分关键的数据库数据,并将其分布在不同的(公共和私有)云提供商中。因此,每个提供者仅获得一小部分数据。这些单独的小部分没有其他部分就毫无价值,并使企业能够满足其有关数据安全性和保护性的合规性规则。因此,云计算成为存储大量数据的有趣替代方法。这项工作从性能的角度评估了我们的方法,并给出了有或没有垂直分区数据的相应查询时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号