首页> 外文会议>International Conference on Service-Oriented Computing >Building Data-Intensive Grid Applications with Globus Toolkit - An Evaluation Based on Web Crawling
【24h】

Building Data-Intensive Grid Applications with Globus Toolkit - An Evaluation Based on Web Crawling

机译:使用Globus Toolkit构建数据密集电网应用程序 - 基于Web爬网的评估

获取原文
获取外文期刊封面目录资料

摘要

Nowadays, there is a trend to create resource-consuming applications without building heavy computer centers, but to use resources on computer systems distributed over the internet. Grid middleware is a framework to access these resources. The concern of this paper is the evaluation of a specific grid middleware, namely Globus Toolkit, for data-intensive applications. As a test case, we have designed and implemented a service-based distributed web crawler on top of this middleware: A web crawler is a complex application consisting of many nodes. It imposes significantly higher demands on grid middleware regarding administrative flexibility compared to grid applications that allocate computing power of grid nodes. We have observed that some components of Globus Toolkit are flexible enough to provide the control functionality necessary for a web crawler, while others are not. For these other components, we propose possible extensions. Since we expect the combination of those characteristics to occur with many other grid applications as well, our study is of broader interest, beyond web crawling.
机译:如今,在没有构建重型计算机中心的情况下创建资源消耗的应用程序,但在互联网上分布的计算机系统上使用资源。网格中间件是访问这些资源的框架。本文的关注是评估特定网格中间件,即Globus Toolkit,用于数据密集型应用。作为一个测试用例,我们在这个中间件的顶部设计并实现了一个基于服务的分布式Web爬虫:Web爬网程序是一个复杂的应用程序,包括许多节点。与分配网格节点的计算能力的网格应用相比,它对网格中间件对网格中间件的需求显着提高了。我们已经观察到Globus Toolkit的某些组件足够灵活,可以为Web爬网程序提供所需的控制功能,而其他组件则不是。对于这些其他组件,我们提出了可能的扩展。由于我们期望这些特征的组合也与许多其他电网应用程序发生,我们的研究是更广泛的兴趣,超越网络爬行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号