首页> 外文OA文献 >An Efficient Architecture for Dynamic Profiling of Multicore Systems
【2h】

An Efficient Architecture for Dynamic Profiling of Multicore Systems

机译:一种高效的多核系统动态分析架构

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Application profiling is an important step in the design and optimization of embedded systems. Accurately identifying and analyzing the execution of frequently executed computational kernels is needed to effectively optimize the system implementation, both at design time and runtime. In a traditional design process, it suffices to perform the profiling and optimization steps offline, during design time. The offline profiling guides the design space exploration, hardware software codesign, or power and performance optimizations. When the system implementation can be finalized at design time, this approach works well. However, dynamic optimization techniques, which adapt and reconfigure the system at runtime, require dynamic profiling with minimum runtime overheads. Existing profiling methods are usually software based and incur significant overheads that may be prohibitive or impractical for profiling embedded systems at runtime. In addition, these profiling methods typically focus on profiling the execution of specific tasks executing on a single processor core, but do not consider accurate and holistic profiling across multiple processor cores. Directly utilizing existing profiling approaches and naively combining isolated profiles from multiple processor cores can lead to significant profile inaccuracies of up to 35%. To address these challenges, a hardware-based dynamic application profiler for non-intrusively and accurately profiling software applications in multicore embedded systems is presented. The profiler provides a detailed execution profile for computational kernels and maintains profile accuracy across multiple processor cores. The hardware-based profiler achieves an average error of less than 0.5% for the percentage execution time of profiled applications while being area efficient.
机译:应用程序配置文件是嵌入式系统设计和优化中的重要步骤。需要准确地识别和分析频繁执行的计算内核的执行,以在设计时和运行时有效地优化系统实现。在传统的设计过程中,足以在设计期间脱机执行概要分析和优化步骤。离线分析可指导设计空间探索,硬件软件代码设计或功率和性能优化。当可以在设计时完成系统实现时,此方法会很好用。但是,在运行时适应和重新配置系统的动态优化技术需要以最少的运行时开销进行动态配置。现有的概要分析方法通常是基于软件的,并且会产生大量开销,这些开销对于在运行时概要分析嵌入式系统可能是禁止的或不切实际的。此外,这些性能分析方法通常侧重于对在单个处理器内核上执行的特定任务的执行进行性能分析,但不考虑跨多个处理器内核进行准确而全面的性能分析。直接利用现有的性能分析方法并将天真地组合来自多个处理器内核的隔离配置文件可能会导致高达35%的配置文件不正确。为了解决这些挑战,提出了一种基于硬件的动态应用程序分析器,用于在多核嵌入式系统中非侵入式且准确地分析软件应用程序。探查器为计算内核提供了详细的执行配置文件,并在多个处理器内核之间保持配置文件的准确性。基于硬件的探查器在执行剖析应用程序的百分比执行时间上实现了小于0.5%的平均误差,同时还提高了区域效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号