首页> 外文期刊>Future generation computer systems >Adding domain data to code profiling tools to debug workflow parallel execution
【24h】

Adding domain data to code profiling tools to debug workflow parallel execution

机译:将域数据添加到代码分析工具以调试工作流并行执行

获取原文
获取原文并翻译 | 示例
           

摘要

Computer simulations may be composed of several scientific programs chained in a coherent flow running in High Performance Computing and cloud environments. These runs may present different execution behavior associated to the parallel flow of data among programs. Gather insight into the parallel flow of data is important for several applications. The usual way of getting insight into code performance is by means of a code-profiler. Several parallel code-profiling tools already support performance analysis, such as Tuning and Analysis Utilities (TAU), or provide Fine-grained performance statistics, e.g., System Activity Report (SAR). These tools are effective for code profiling, but are not connected to the concept of IO-intensive workflows. Analyzing the workflow execution with domain and performance data is important for users because they can identify anomalies, choose suitable machines to run their workflows, etc. This type of analysis may be performed by capturing execution data enriched with fine-grained domain data during the long-term run of a computer simulation. In this paper, we propose a monitoring data capture approach as a component that couples code-profiling tools to domain data from workflow executions. The goal is to profile and debug parallel executions of workflows through queries to a database that integrates performance, resource consumption, provenance, and domain data from simulation programs flow at runtime. We show how querying this database with domain-aware data at runtime allows to identify performance anomalies not detected by code-profiling tools. We evaluate our approach using the astronomy Montage workflow on a cluster environment and the SciPhy bioinformatics workflow on the Amazon cloud. In both cases computing time overhead imposed by our approach for gathering finegrained domain, performance, and resource consumption data is negligible.
机译:计算机模拟可以由在高性能计算和云环境中运行的连贯流动中链接的若干科学节目组成。这些运行可以呈现与程序之间的并行数据流相关的不同执行行为。收集洞察力对数据的并行流程对于若干应用很重要。通常介绍代码性能的通常方法是代码探查器。几个并行代码分析工具已经支持性能分析,例如调谐和分析公用事业(TAU),或提供细粒度的性能统计,例如系统活动报告(SAR)。这些工具对于代码分析有效,但没有连接到IO密集工作流的概念。分析使用域和性能数据的工作流程执行对于用户来说很重要,因为它们可以识别异常,选择合适的机器来运行其工作流等。这种类型的分析可以通过捕获长时间捕获有细粒域数据的执行数据来执行这种类型的分析 - 电脑仿真运行。在本文中,我们提出了一种监视数据捕获方法作为从工作流程执行的域数据耦合到域数据的组件。目标是通过对数据库的数据库进行配置文件和调试并行执行工作流程,该数据库集成了从运行时从仿真程序流中的模拟程序流程的性能,资源消耗,出种和域数据。我们展示了如何在运行时使用域感知数据查询此数据库允许识别代码分析工具未检测到的性能异常。我们评估了我们在群集环境中的天文蒙太奇工作流程和亚马逊云上的Sciphy Bioinfinomatics工作流程的方法。在两种情况下,通过我们采集Finegreatmormod域,性能和资源消耗数据所施加的计算时间开销可以忽略不计。

著录项

  • 来源
    《Future generation computer systems》 |2020年第9期|422-439|共18页
  • 作者单位

    Department of Computer Science Federal University of Rio de Janeiro/COPPE Brazil;

    Department of Computer Science Federal University of Rio de Janeiro/COPPE Brazil;

    Department of Computer Science Federal University of Rio de Janeiro/COPPE Brazil IBM Research Brazil;

    High Performance Computing Center and Department of Civil Engineering Federal University of Rio de Janeiro/COPPE Brazil;

    Institute of Computing Fluminense Federal University Brazil;

    Department of Computer Science Federal University of Rio de Janeiro/COPPE Brazil;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Performance analysis; Debugging; Scientific workflow; Provenance;

    机译:性能分析;调试;科学工作流程;来源;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号