首页> 外文OA文献 >Performance-driven refactoring of Potts associative memory network model
【2h】

Performance-driven refactoring of Potts associative memory network model

机译:性能驱动的Potts关联内存网络模型重构

摘要

Neural networks simulations have always been a complex computational chal-udlenge because of the requirements of large amount of computational andudmemory resources. Due to the nature of the problem, a high performanceudcomputing approach becomes vital, because the dynamics often involves theudupdate of a large network for a large number of time steps. Moreover, theudparameter space can be fairly large. An advanced optimization for the singleudtime step is therefore necessary, as well as a strategy to explore the parameterudspace in an automatic fashion.udThis work rst examines the purely serial original code, identifying itsudbottlenecks and ine cient design choices. After that, several optimizationsudstrategies are presented and discussed, exploiting vectorization, e cient mem-udory access and cache usage. The strategies are presented together with anudextensive set of the benchmarks and a detailed discussion of all the issuesudencountered.udThe nal part of the work is the design of a high throughput approachudto the paramenter sweep, necessary to explore the behaviour of the network.udThis is implemented by means of a task manager that takes care of runningudsimulations from a batch of prede ned runs in an automatic way and collectsudtheir results. A detailed performance analysis of the task manager is reported.udThe results of the work show a consistent speed up for the single-runudcase, and a massive productivity improvement thanks to the task-manager.udMoreover, the code base is now reorganized to favor extensibility and codeudreuse, allowing the application of several of the present strategies to otherudproblems as well.
机译:由于需要大量的计算和内存资源,因此神经网络仿真一直是一个复杂的计算挑战。由于问题的性质,高性能 udcomputing方法变得至关重要,因为动力学经常涉及对大型网络进行大量时间步长的 udupdate。此外, udparameter空间可以相当大。因此,有必要对单个 udtime步骤进行高级优化,以及采取一种自动探索参数 udspace的策略。 ud这项工作首先检查了纯串行原始代码,确定了其 udbottle瓶颈和无效的设计选择。之后,提出并讨论了几种优化方法/策略,它们利用了向量化,有效的内存/伪造访问和缓存使用。提出了这些策略,同时还提供了一组泛泛的基准测试以及对所有问题的详细讨论遇到过 ud。工作的最后一部分是设计高吞吐量方法 ud进行参数扫描,这对于探索行为是必不可少的。 ud这是通过任务管理器来实现的,该任务管理器自动处理一批预定运行中的运行 udsim,并收集 udthe结果。报告了对任务管理器的详细性能分析。 ud工作结果表明,单次运行 udcase的速度始终如一,并且由于任务管理器而大大提高了生产率。 ud此外,现在的代码库重新组织以支持可扩展性和代码重用,从而允许将当前策略中的一些应用到其他 udproblem。

著录项

  • 作者

  • 作者单位
  • 年度 2015
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号