首页> 外文会议>IEEE/ACM international symposium on cluster, cloud and grid computing >Discovering and Leveraging Content Similarity to Optimize Collective on-Demand Data Access to IaaS Cloud Storage
【24h】

Discovering and Leveraging Content Similarity to Optimize Collective on-Demand Data Access to IaaS Cloud Storage

机译:发现和利用内容相似性来优化对IaaS云存储的按需集体数据访问

获取原文

摘要

A critical feature of IaaS cloud computing is the ability to quickly disseminate the content of a shared dataset at large scale. In this context, a common pattern is collective on-demand read, i.e., accessing the same VM image or dataset from a large number of V Minstances concurrently. There are various techniques that avoid I/Ocontention to the storage service where the dataset is located without relying on pre-broadcast. Most such techniques employ peer-to-peer collaborative behavior where the VM instances exchange information about the content that was accessed during runtime, such that it impossible to fetch the missing data pieces directly from each other rather than the storage system. However, such techniques are often limited within a group that performs a collective read. In light of high data redundancy on large IaaS data centers and multiple users that simultaneously run VM instance groups that perform collective reads, an important opportunity arises: enabling unrelated VMinstances belonging to different groups to collaborate and exchange common data in order to further reduce the I/O pressure on the storage system. This paper deals with the challenges posed by such absolution, which prompt the need for novel techniques to efficiently detect and leverage common data pieces across groups. To this end, we introduce a low-overhead fingerprint based approach that we evaluate and demonstrate to be efficient in practice for a representative scenario on dozens of nodes and a variety of group configurations.
机译:IaaS云计算的一项关键功能是能够快速大规模传播共享数据集的内容。在这种情况下,共同的模式是集体按需读取,即从大量V实例同时访问同一VM映像或数据集。有多种技术可以避免对数据集所在的存储服务的I / O争用,而无需依赖于预广播。大多数此类技术采用点对点协作行为,其中VM实例交换有关在运行时访问的内容的信息,这样就不可能直接从彼此而不是从存储系统中获取丢失的数据。但是,这样的技术通常限于执行集体读取的组内。鉴于大型IaaS数据中心和同时运行执行集体读取的VM实例组的多个用户的高数据冗余性,出现了一个重要的机会:使属于不同组的不相关的VM实例能够协作并交换通用数据,以进一步减少I / O对存储系统的压力。本文探讨了这种解决方案带来的挑战,这些挑战促使人们需要新颖的技术来有效地检测和利用各组之间的通用数据。为此,我们引入了一种基于低开销指纹的方法,我们对该方法进行了评估,并证明了该方法对于许多节点和各种组配置上的典型情况在实践中是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号