首页> 外国专利> Deep convex network with joint use of nonlinear random projection, Restricted Boltzmann Machine and batch-based parallelizable optimization

Deep convex network with joint use of nonlinear random projection, Restricted Boltzmann Machine and batch-based parallelizable optimization

机译：结合使用非线性随机投影，受限玻尔兹曼机和基于批处理的可并行优化的深凸网络

页面导航

摘要
著录项
相似文献

摘要

A method is disclosed herein that includes an act of causing a processor to access a deep-structured, layered or hierarchical model, called deep convex network, retained in a computer-readable medium, wherein the deep-structured model comprises a plurality of layers with weights assigned thereto. This layered model can produce the output serving as the scores to combine with transition probabilities between states in a hidden Markov model and language model scores to form a full speech recognizer. The method makes joint use of nonlinear random projections and RBM weights, and it stacks a lower module's output with the raw data to establish its immediately higher module. Batch-based, convex optimization is performed to learn a portion of the deep convex network's weights, rendering it appropriate for parallel computation to accomplish the training. The method can further include the act of jointly substantially optimizing the weights, the transition probabilities, and the language model scores of the deep-structured model using the optimization criterion based on a sequence rather than a set of unrelated frames.

机译：本文公开了一种方法，该方法包括使处理器访问保留在计算机可读介质中的称为深凸网络的深结构，分层或分层模型的动作，其中该深结构模型包括具有以下特征的多个层：分配给它们的权重。该分层模型可以产生用作得分的输出，并与隐藏的马尔可夫模型中的状态之间的转换概率和语言模型得分相结合，以形成完整的语音识别器。该方法结合使用非线性随机投影和RBM权重，并将较低模块的输出与原始数据堆叠在一起，以建立其紧邻的较高模块。执行基于批次的凸优化，以了解深凸网络权重的一部分，使其适合于并行计算以完成训练。该方法可以进一步包括以下动作：基于序列而不是一组不相关的帧，使用最优化准则来共同基本优化深度结构化模型的权重，过渡概率和语言模型得分。

著录项

公开/公告号US8489529B2

专利类型
公开/公告日2013-07-16

原文格式PDF
申请/专利权人 LI DENG;DONG YU;ALEJANDRO ACERO;
展开▼

申请/专利号US201113077978
发明设计人 LI DENG;DONG YU;ALEJANDRO ACERO;
展开▼

申请日2011-03-31
分类号G06N5;
国家 US
入库时间 2022-08-21 16:46:22

相似文献

专利
外文文献
中文文献