Indexing Multi-dimensional Data in a Cloud System

机译：在云系统中索引多维数据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Providing scalable database services is an essential requirement for extending many existing applications of the Cloud platform. Due to the diversity of applications, database services on the Cloud must support large-scale data analytical jobs and high concurrent OLTP queries. Most existing work focuses on some specific type of applications. To provide an integrated framework, we are designing a new system, epiC, as our solution to next-generation database systems. In epiC, indexes play an important role in improving overall performance. Different types of indexes are built to provide efficient query processing for different applications.In this paper, we propose RT-CAN, a multi-dimensional indexing scheme in epiC. RT-CAN integrates CAN [23]-based routing protocol and the R-tree based indexing scheme to support efficient multi-dimensional query processing in a Cloud system. RT-CAN organizes storage and compute nodes into an overlay structure based on an extended CAN protocol. In our proposal, we make a simple assumption that each compute node uses an R-tree like indexing structure to index the data that are locally stored. We propose a query-conscious cost model that selects beneficial local R-tree nodes for publishing. By keeping the number of persistently connected nodes small and maintaining a global multi-dimensional search index, we can locate the compute nodes that may contain the answer with a few hops, making the scheme scalable in terms of data volume and number of compute nodes. Experiments on Amazon's EC2 show that our proposed routing protocol and indexing scheme are robust, efficient and scalable.

机译：提供可伸缩的数据库服务是扩展Cloud Platform的许多现有应用程序的基本要求。由于应用程序的多样性，云上的数据库服务必须支持大规模数据分析作业和高并发OLTP查询。现有的大多数工作都集中在某些特定类型的应用程序上。为了提供一个集成的框架，我们正在设计一个新的系统epiC，作为我们对下一代数据库系统的解决方案。在epiC中，索引在提高整体性能方面起着重要作用。构建不同类型的索引可为不同的应用程序提供有效的查询处理。在本文中，我们提出了RT-CAN，这是epiC中的多维索引方案。 RT-CAN集成了基于CAN [23]的路由协议和基于R-tree的索引方案，以支持Cloud系统中高效的多维查询处理。 RT-CAN基于扩展的CAN协议将存储和计算节点组织为覆盖结构。在我们的建议中，我们做出一个简单的假设，即每个计算节点都使用类似R树的索引结构来索引本地存储的数据。我们提出一个查询意识的成本模型，该模型选择有利的本地R-tree节点进行发布。通过使持久连接的节点数保持较小并保持全局多维搜索索引，我们可以通过几跳来定位可能包含答案的计算节点，从而使该方案在数据量和计算节点数方面具有可扩展性。在Amazon EC2上进行的实验表明，我们提出的路由协议和索引方案是可靠，高效和可扩展的。

著录项

来源
《ACM SIGMOD international conference on management of data;SIGMOD 2010》|2010年|P.591-602|共12页
会议地点
作者
Jinbao Wang; Sai Wu; Hong Gao; Jianzhong Li; Beng Chin Ooi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词
cloud; index; query processing;

机译：云;指数;查询处理;

相似文献

外文文献
中文文献
专利

1. A PR-quadtree based multi-dimensional indexing for complex query in a cloud system [J] . Jian-feng Li, Shi-ping Chen, Lin-mao Duan, Cluster computing . 2017,第4期

机译：基于PR-Quadtree基于云系统复杂查询的多维索引
2. A New Multi-Dimensional Hyperbolic Structure for Cloud Service Indexing [J] . Telesphore Tiendrebeogo, Oumarou Sié International Journal of Database Management Systems . 2016,第2期

机译：用于云服务索引的新的多维双曲结构
3. Accelerate Data Retrieval by Multi-Dimensional Indexing in Switch-Centric Data Centers [J] . Xinjian Luo, Xiaofeng Gao, Guihai Chen, The Computer Journal . 2019,第2期

机译：通过以交换为中心的数据中心中的多维索引来加速数据检索
4. Indexing Multi-dimensional Data in a Cloud System [C] . ACM SIGMOD international conference on management of data . 2010

机译：索引云系统中的多维数据
5. Multi-dimensional indexing for XML data. [D] . Kim, Do Youn. 2005

机译：XML数据的多维索引。
6. Controlled Vocabularies Indexing and Medical Language Processing. Expert Indexing Systems: Research on Interactive Knowledge-Based Indexing: The MedIndEx Prototype [O] . Susanne M. Humphrey 1989

机译：受控词汇表索引编制和医学语言处理。专家索引系统：基于交互式知识的索引的研究：MedIndEx原型
7. Cognitive visual analytics of multi-dimensional cloud system monitoring data [O] . Baciu G, Wang YZ, Li CH 2016

机译：多维云系统监控数据的认知视觉分析

Indexing Multi-dimensional Data in a Cloud System

摘要

著录项

相似文献

相关主题

期刊订阅