...
首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >An Online Data Access Prediction and Optimization Approach for Distributed Systems
【24h】

An Online Data Access Prediction and Optimization Approach for Distributed Systems

机译:分布式系统的在线数据访问预测与优化方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Current scientific applications have been producing large amounts of data. The processing, handling and analysis of such data require large-scale computing infrastructures such as clusters and grids. In this area, studies aim at improving the performance of data-intensive applications by optimizing data accesses. In order to achieve this goal, distributed storage systems have been considering techniques of data replication, migration, distribution, and access parallelism. However, the main drawback of those studies is that they do not take into account application behavior to perform data access optimization. This limitation motivated this paper which applies strategies to support the online prediction of application behavior in order to optimize data access operations on distributed systems, without requiring any information on past executions. In order to accomplish such a goal, this approach organizes application behaviors as time series and, then, analyzes and classifies those series according to their properties. By knowing properties, the approach selects modeling techniques to represent series and perform predictions, which are, later on, used to optimize data access operations. This new approach was implemented and evaluated using the OptorSim simulator, sponsored by the LHC-CERN project and widely employed by the scientific community. Experiments confirm this new approach reduces application execution time in about 50 percent, specially when handling large amounts of data.
机译:当前的科学应用已产生大量数据。此类数据的处理,处理和分析需要大规模的计算基础架构,例如集群和网格。在这一领域,研究旨在通过优化数据访问来提高数据密集型应用程序的性能。为了实现此目标,分布式存储系统一直在考虑数据复制,迁移,分发和访问并行性的技术。但是,这些研究的主要缺点是,它们没有考虑应用程序行为来执行数据访问优化。这种局限性促使本文采用策略来支持对应用程序行为的在线预测,以便优化分布式系统上的数据访问操作,而无需任何有关过去执行的信息。为了实现此目标,此方法将应用程序行为组织为时间序列,然后根据这些序列的属性对其进行分析和分类。通过了解属性,该方法选择建模技术来表示序列并执行预测,这些预测随后将用于优化数据访问操作。该新方法是由LHC-CERN项目赞助并被科学界广泛采用的OptorSim模拟器实施和评估的。实验证实,这种新方法可将应用程序执行时间减少约50%,特别是在处理大量数据时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号