首页> 外文会议> >FREERIDE-G: Supporting Applications that Mine Remote FREERIDE-G: Supporting Applications that Mine Remote

【24h】

FREERIDE-G: Supporting Applications that Mine Remote FREERIDE-G: Supporting Applications that Mine Remote

机译：FREERIDE-G：支持远程开采的应用程序FREERIDE-G：支持远程开采的应用程序

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Analysis of large geographically distributed scientific datasets, also referred to as distributed data-intensive science, has emerged as an important area in recent years. An application that processes data from a remote repository needs to be broken into several stages, including a data retrieval task at the data repository, a data movement task, and a data processing task at a computing site. Because of the volume of data that is involved and the amount of processing, it is desirable that both the data repository and computing site may be clusters. This can further complicate the development of such data processing applications. In this paper, we present a middleware, FREERIDE-G (framework for rapid implementation of datamining engines in grid), which support a high-level interface for developing data mining and scientific data processing applications that involve data stored in remote repositories. Particularly, we had the following goals behind designing the FREERIDE-G middleware: 1) support high-end processing, i.e., use parallel configurations for both hosting the data and processing the data, 2) ease use of parallel configurations, i.e., support a high-level API for specifying the processing, and 3) hide details of data movement and caching. We have evaluated our system using three popular data mining algorithms and two scientific data analysis applications. The main observations from our experiments are as follows. First, FREERIDE-G is able to scale the processing extremely well when the number of data server and compute nodes are scaled evenly. Second, when only the number of compute nodes are scaled, our target class of applications achieve modest additional speedups. Finally, for applications that involve multiple passes on the dataset, caching remote data provides significant improvement

机译：近年来，大型地理分布科学数据集（也称为分布式数据密集型科学）的分析已成为重要领域。处理来自远程存储库的数据的应用程序需要分为几个阶段，包括数据存储库中的数据检索任务，数据移动任务和计算站点中的数据处理任务。由于涉及的数据量和处理量大，因此希望数据存储库和计算站点都可以是群集。这会使这种数据处理应用程序的开发进一步复杂化。在本文中，我们提出了一种中间件，即FREERIDE-G（用于在网格中快速实现数据挖掘引擎的框架），该中间件支持用于开发数据挖掘和科学数据处理应用程序的高层接口，这些应用程序涉及存储在远程存储库中的数据。特别是，在设计FREERIDE-G中间件后，我们有以下目标：1）支持高端处理，即使用并行配置来托管数据和处理数据; 2）简化并行配置的使用，即支持用于指定处理的高级API，以及3）隐藏数据移动和缓存的详细信息。我们已经使用三种流行的数据挖掘算法和两个科学数据分析应用程序评估了我们的系统。我们的实验的主要观察结果如下。首先，当数据服务器和计算节点的数量均匀扩展时，FREERIDE-G能够非常好地扩展处理能力。其次，当仅扩展计算节点的数量时，我们的目标应用程序类别将实现适度的额外加速。最后，对于涉及数据集多次遍历的应用程序，缓存远程数据可显着改善

著录项

来源
《》|2006年|109-118|共10页
会议地点
作者
Leonid Glimcher; Ruoming Jin; Gagan Agrawal;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
data mining; grid computing; middleware; natural sciences computing; very large databases; FREERIDE-G middleware; high-end processing; large geographically distributed scientific datasets; parallel configurations; remote data repositories mining; scientific data an;

机译：数据挖掘;网格计算;中间件;自然科学计算;非常大的数据库; FREERIDE-G中间件;高端处理;大型地理分布的科学数据集;并行配置;远程数据仓库挖掘;科学数据和;

相似文献

外文文献
中文文献
专利

1. The Applications of Satellite Based Remote Sensing Techniques in the Hydrological Assessment of Mine Water Supply and Management Systems [J] . Mei Lin Shelp, Guosheng Zhan, Bill Upton Mine water and the environment . 2011,第4期

机译：卫星遥感技术在矿井供水管理系统水文评价中的应用
2. Tested and Proved in Western U.S. Mine Operations, P&H~R PreVail~R Remote Health Monitoring Technology Eyed for P&H Mining Shovel Applications Worldwide [J] . Mining Engineering . 2010,第9期

机译：P＆H〜R PreVail〜R远程健康监控技术已在美国西部矿山运营中进行了测试和验证，可望在全球的P＆H采矿铲应用中使用
3. UAS Remote Sensing Products for Supporting Extraction Management and Restoration Monitoring in Open-Pit Mines [J] . Vicen? Carabassa, Pau Montero, Marc Crespo, Proceedings . 2019,第1期

机译：UAS遥感产品，用于支持露天矿区的提取管理和恢复监测
4. FREERIDE-G: Supporting Applications that Mine Remote FREERIDE-G: Supporting Applications that Mine Remote [C] . Leonid Glimcher, Ruoming Jin, Gagan Agrawal International Conference on Parallel Processing . 2006

机译：Freeride-G：支持挖掘远程Freeride-g的应用程序：支持迈出遥控器的应用程序
5. Applications of multi-season hyperspectral remote sensing for acid mine water characterization and mapping of secondary iron minerals associated with acid mine drainage. [D] . Davies, Gwendolyn E. 2015

机译：多季节高光谱遥感在酸性矿山水特征描述和与酸性矿山排水相关的次生铁矿物测绘中的应用。
6. Determining spatio-temporal distribution of bee forage species of Al-Baha region based on ground inventorying supported with GIS applications and Remote Sensed Satellite Image analysis [O] . Nuru Adgaba, Ahmed Alghamdi, Rachid Sammoud, 2017

机译：基于GIS应用程序和遥感卫星图像分析支持的地面清点确定Al-Baha地区的蜜蜂觅食物种的时空分布
7. Bluetooth Beacon-Based Mine Production Management Application to Support Ore Haulage Operations in Underground Mines [O] . Sebeom Park, Yosoon Choi 2021

机译：基于蓝牙灯塔的矿山生产管理应用，以支持地下矿山的矿石运营
8. Remotely Placed Concrete/Gravel Columns for Point Support with InnovativeConcrete Placement Device and a Pneumatic Feeder for Remote Sealing and Support of Abandoned Mines [R] . Burnett, M., El-Korchi, T., Burnett, J. M. 1993

机译：用于点支撑的远程放置混凝土/砾石柱，带有InnovativeConcrete放置装置和用于远程密封和支撑废弃矿井的气动进料器

FREERIDE-G: Supporting Applications that Mine Remote FREERIDE-G: Supporting Applications that Mine Remote

摘要

著录项

相似文献

相关主题

期刊订阅