首页> 外文会议>European simulation and modelling conference >PERFORMANCE ANALYSIS OF A 240 THREAD TOURNAMENT LEVEL MCTS GO PROGRAM ON THE INTEL XEON PHI
【24h】

PERFORMANCE ANALYSIS OF A 240 THREAD TOURNAMENT LEVEL MCTS GO PROGRAM ON THE INTEL XEON PHI

机译:240线程锦标赛级MCTS在英特尔Xeon Phi上的绩效分析

获取原文

摘要

In 2013 Intel introduced the Xeon Phi, a new parallel coprocessor board. The Xeon Phi is a cache-coherent many-core shared memory architecture claiming CPU-like versatility, programmability, high performance, and power efficiency. The first published micro-benchmark studies indicate that many of Intel's claims appear to be true. The current paper is the first study on the Phi of a complex artificial intelligence application. It contains an open source MCTS application for playing tournament quality Go (an oriental board game). We report the first speedup figures for up to 240 parallel threads on a real machine, allowing a direct comparison to previous simulation studies. After a substantial amount of work, we observed that performance scales well up to 32 threads, largely confirming previous simulation results of this Go program, although the performance surprisingly deteriorates between 32 and 240 threads. Furthermore, we report (1) unexpected performance anomalies between the Xeon Phi and Xeon CPU for small problem sizes and small numbers of threads, and (2) that performance is sensitive to scheduling choices. Achieving good performance on the Xeon Phi for complex programs is not straightforward; it requires a deep understanding (1) of search patterns, (2) of scheduling, and (3) of the architecture and its many cores and caches. In practice, the Xeon Phi is less straightforward to program for than originally envisioned by Intel.
机译:2013年英特尔推出了Xeon Phi,这是一个新的并联协处理器。 Xeon Phi是一种缓存连贯的多核共享内存架构,索取CPU的多功能性,可编程性,高性能和功率效率。第一个公布的微基准研究表明,英特尔的许多索赔似乎是真实的。目前的论文是对复杂人工智能应用的PHI第一研究。它包含一个开源MCTS应用程序,用于播放锦标赛质量(东方板游戏)。我们在真机上报告了最多240个平行线程的第一个加速数字,允许直接比较以前的模拟研究。经过大量的工作,我们观察到性能较好地缩放到32个线程,很大程度上确认了此Go程序的先前模拟结果,尽管在32到240个线程之间的性能令人惊讶地恶化。此外,我们在Xeon Phi和Xeon CPU之间报告(1)出乎意料的性能异常,用于小问题尺寸和少量线程,以及(2)性能对调度选择敏感。在Xeon Phi实现复杂计划的良好表现并不简单;它需要深入了解(1)的搜索模式,(2)调度,(3)的架构及其许多核心和高速缓存。在实践中,Xeon Phi比英特尔最初设想的计划不太直接。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号