首页> 外文会议>IASTED International Conference on Parallel and Distributed Computing and Systems >Performance Evaluation of SNPs Machine-Learning Workload on Intelreg; Pentiumreg; 4 Hyper-Threading Architectures
【24h】

Performance Evaluation of SNPs Machine-Learning Workload on Intelreg; Pentiumreg; 4 Hyper-Threading Architectures

机译:英特尔&reg上SNPS机器学习工作量的性能评估; Pentium® 4超线程架构

获取原文

摘要

This paper analyzes a Pentium 4 hyper-threading processor and a Pentium 4 hyper-threading processor on 90nm technology with a machine learning workload parallelized with OpenMP~* and Intel compiler. The focus is to understand SNPs performance and the underlying reasons behind that performance. The particular attention is paid to micro-architecture metrics and comparison to examine and evaluate, where appropriate, how those two types of processors perform relative to expectation on SNP machine learning workloads. Results include parallel speedup, micro-architecture metrics comparison.
机译:本文分析了奔腾4个超线程处理器和90nm技术的奔腾4个超线程处理器,使用与OpenMP〜*和英特尔编译器并行化的机器学习工作负载。 重点是了解SNPS性能和表现背后的基本原因。 在适当的情况下,将特别注意微型建筑指标和比较,以便在适当的情况下检查和评估这些类型的处理器如何相对于SNP机器学习工作负载的期望。 结果包括并行加速,微架构度量比较。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号