首页> 外文会议>2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS) >eScience in the cloud: A MODIS satellite data reprojection and reduction pipeline in the Windows Azure platform
【24h】

eScience in the cloud: A MODIS satellite data reprojection and reduction pipeline in the Windows Azure platform

机译:云中的电子科学:Windows Azure平台中的MODIS卫星数据重新投影和缩减管道

获取原文
获取原文并翻译 | 示例

摘要

The combination of low-cost sensors, low-cost commodity computing, and the Internet is enabling a new era of data-intensive science. The dramatic increase in this data availability has created a new challenge for scientists: how to process the data. Scientists today are envisioning scientific computations on large scale data but are having difficulty designing software architectures to accommodate the large volume of the often heterogeneous and inconsistent data. In this paper, we introduce a particular instance of this challenge, and present our design and implementation of a MODIS satellite data reprojection and reduction pipeline in the Windows Azure cloud computing platform. This cloud-based pipeline is designed with a goal of hiding data complexities and subsequent data processing and transformation from end users. This pipeline is highly flexible and extensible to accommodate different science data processing tasks, and can be dynamically scaled to fulfill scientists' various computational requirements in a cost-efficient way. Experiments show that by running a practical large-scale science data processing job in the pipeline using 150 moderately-sized Azure virtual machine instances, we were able to produce analytical results in nearly 90× less time than was possible with a high-end desktop machine. To our knowledge, this is one of the first eScience applications to use the Windows Azure platform.
机译:低成本传感器,低成本商品计算和Internet的结合,开创了数据密集型科学的新时代。数据可用性的急剧增加为科学家带来了新的挑战:如何处理数据。今天的科学家正在设想对大规模数据进行科学计算,但是在设计软件体系结构以适应大量通常是异构的和不一致的数据时遇到了困难。在本文中,我们介绍了此挑战的一个特定实例,并介绍了Windows Azure云计算平台中MODIS卫星数据重新投影和缩减管道的设计和实现。设计这种基于云的管道的目的是隐藏数据复杂性以及对最终用户的后续数据处理和转换。该管道具有高度的灵活性和可扩展性,可以适应不同的科学数据处理任务,并且可以动态扩展以经济高效的方式满足科学家的各种计算要求。实验表明,通过使用150个中等大小的Azure虚拟机实例在管道中运行实用的大规模科学数据处理工作,我们能够在近90倍的时间内生成分析结果,而与高端产品相比,时间更短台式机。据我们所知,这是最早使用Windows Azure平台的eScience应用程序之一。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号