首页> 外文会议>ACMKDD International Conference on Knowledge Discovery and Data Mining;KDD 2008 >Data Mining Using High Performance Data Clouds: Experimental Studies Using Sector and Sphere
【24h】

Data Mining Using High Performance Data Clouds: Experimental Studies Using Sector and Sphere

机译:使用高性能数据云的数据挖掘:使用部门和领域的实验研究

获取原文

摘要

We describe the design and implementation of a high performance cloud that we have used to archive, analyze and mine large distributed data sets. By a cloud, we mean an infrastructure that provides resources and/or services over the Internet. A storage cloud provides storage services, while a compute cloud provides compute services. We describe the design of the Sector storage cloud and how it provides the storage services required by the Sphere compute cloud. We also describe the programming paradigm supported by the Sphere compute cloud. Sector and Sphere are designed for analyzing large data sets using computer clusters connected with wide area high performance networks (for example, 10+ Gb/s). We describe a distributed data mining application that we have developed using Sector and Sphere. Finally, we describe some experimental studies comparing Sector/Sphere to Hadoop.
机译:我们描述了用于存档,分析和挖掘大型分布式数据集的高性能云的设计和实现。云是指通过Internet提供资源和/或服务的基础架构。存储云提供存储服务,而计算云提供计算服务。我们描述了扇区存储云的设计以及它如何提供Sphere计算云所需的存储服务。我们还将描述Sphere计算云支持的编程范例。 Sector和Sphere旨在使用与广域高性能网络(例如10+ Gb / s)连接的计算机群集来分析大型数据集。我们描述了使用Sector和Sphere开发的分布式数据挖掘应用程序。最后,我们描述了一些将Sector / Sphere与Hadoop进行比较的实验研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号