首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Refinements of regression-based context-dependent modelling of deep neural networks for automatic speech recognition
【24h】

Refinements of regression-based context-dependent modelling of deep neural networks for automatic speech recognition

机译:用于自动语音识别的基于回归的深度神经网络建模的改进

获取原文
获取外文期刊封面目录资料

摘要

The data sparsity problem of context-dependent (CD) acoustic modelling of deep neural networks (DNNs) in speech recognition is addressed by using the decision tree state clusters as the training targets. The CD states within a cluster cannot be distinguished during decoding. This problem, referred to as the clustering problem, is not explicitly addressed in the current literature. In our previous work, a regression-based CD-DNN framework was proposed to address both the data sparsity and the clustering problems. This paper investigates several refinements for the regression-based CD-DNN including two more representative state approximation schemes and the incorporation of sequential learning. The two approximations are obtained based on the statistics learned from the training data. Sequential learning is applied to both broad phone DNN detectors and the regression NN. The proposed refinements are evaluated on a broadcast news transcription task. For the cross-entropy systems, the two approximations perform consistently better than our previous work. Consistent performance gain over the corresponding cross-entropy trained systems is also observed for both the baseline CD-DNN and the regression model with sequential learning.
机译:通过使用决策树状态簇作为训练目标,解决了语音识别中深层神经网络(DNN)的上下文相关(CD)声学建模的数据稀疏性问题。在解码期间无法区分群集内的CD状态。这个问题,称为聚类问题,在当前文献中没有明确解决。在我们之前的工作中,提出了一种基于回归的CD-DNN框架来解决数据稀疏性和聚类问题。本文研究了基于回归的CD-DNN的几种改进,包括两个更具代表性的状态逼近方案和顺序学习的结合。这两个近似值是根据从训练数据中学到的统计数据获得的。顺序学习适用于广泛的电话DNN检测器和回归NN。在广播新闻转录任务上对提出的改进方案进行了评估。对于交叉熵系统,两个近似的性能始终优于我们之前的工作。对于基线CD-DNN和具有顺序学习的回归模型,在相应的交叉熵训练的系统上也观察到一致的性能提升。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号