首页> 外国专利> Method and system for parallel statistical inference on highly parallel platforms

Method and system for parallel statistical inference on highly parallel platforms

机译:在高度并行的平台上进行并行统计推断的方法和系统

摘要

Methods for faster statistical inference in computation based recognition problems on highly parallel processors with multiple cores on-a-chip are disclosed, which include: selectively flattening levels of the recognition network to improve inference speed (improving the recognition model); selectively duplicating parts of the recognition network to minimize a critical section in atomic accesses to as few as one atomic instruction (improving the recognition procedure); and combining weight and source port into one 32-bit word to minimize the number of atomic operations. These methods have been implemented on an NVIDIA GTX 280 processor in a Large Vocabulary Continuous Speech Recognition (LVCSR) embodiment, and achieve more than a 10× speed up compared to a highly optimized sequential implementation on an Intel Core i7 processor.
机译:公开了用于在具有多个芯片上芯片的高度并行处理器上的基于计算的识别问题中更快的统计推理的方法,该方法包括:选择性地使识别网络的水平平坦化以提高推理速度(改进识别模型);有选择地复制识别网络的各个部分,以使原子访问中的关键部分最小化至少至一个原子指令(改进了识别程序);并将权重和源端口组合为一个32位字,以最大程度地减少原子操作的次数。这些方法已经在大型词汇连续语音识别(LVCSR)实施例中的NVIDIA GTX 280处理器上实现,并且与在Intel Core i7处理器上高度优化的顺序实现相比,实现了10倍以上的加速。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号