首页>
外国专利>
Method and system for parallel statistical inference on highly parallel platforms
Method and system for parallel statistical inference on highly parallel platforms
展开▼
机译:在高度并行的平台上进行并行统计推断的方法和系统
展开▼
页面导航
摘要
著录项
相似文献
摘要
Methods for faster statistical inference in computation based recognition problems on highly parallel processors with multiple cores on-a-chip are disclosed, which include: selectively flattening levels of the recognition network to improve inference speed (improving the recognition model); selectively duplicating parts of the recognition network to minimize a critical section in atomic accesses to as few as one atomic instruction (improving the recognition procedure); and combining weight and source port into one 32-bit word to minimize the number of atomic operations. These methods have been implemented on an NVIDIA GTX 280 processor in a Large Vocabulary Continuous Speech Recognition (LVCSR) embodiment, and achieve more than a 10× speed up compared to a highly optimized sequential implementation on an Intel Core i7 processor.
展开▼