首页> 外文会议> >Speech recognition using sub-word neural tree network models and multiple classifier fusion
【24h】

Speech recognition using sub-word neural tree network models and multiple classifier fusion

机译:使用子词神经树网络模型和多分类器融合的语音识别

获取原文

摘要

A new neural tree network (NTN)-based speech recognition system is presented. NTN is a hierarchial classifier that combines the properties of decision trees and feed-forward neural networks. In the sub-word unit-based system, the NTNs model the sub-word speech segments, while the Viterbi algorithm is used for temporal alignment. Durational probability is associated with each sub-word NTN. An iterative algorithm is proposed for training the sub-word NTNs. The sub-word NTN models, as well as the subword segment boundaries within a vocabulary word, are re-estimated. Thus, the proposed system is a homogeneous neural network-based, sub-word unit-based, speech recognition system. Furthermore, embedded within this word model paradigm, multiple NTNs are trained for each subword segment and their output decisions are combined or fused to yield improved performance. The proposed discriminatory training-based system did not perform favourably as compared to a hidden Markov model-based system. The paradigm presented in this paper can be argued to represent a class of discriminatory training-based, homogeneous (versus hybrid), sub-word unit-based, speech recognition systems. Hence, the results reported here can be generalized to other similar systems.
机译:提出了一种新的基于神经树网络(NTN)的语音识别系统。 NTN是一种分层分类器,结合了决策树和前馈神经网络的属性。在基于子词单元的系统中,NTN为子词语音片段建模,而维特比算法用于时间对齐。持续时间概率与每个子词NTN相关联。提出了一种迭代算法来训练子词NTN。重新估计子词NTN模型以及词汇词中的子词片段边界。因此,所提出的系统是基于同类神经网络,基于子词单元的语音识别系统。此外,嵌入到此单词模型范式中的每个子单词片段都训练了多个NTN,并且将它们的输出决策进行组合或融合以提高性能。与基于隐马尔可夫模型的系统相比,所提出的基于歧视性训练的系统表现不佳。本文提出的范式可以被认为代表了一类基于歧视性训练的,同质(相对于混合),基于子词单元的语音识别系统。因此,此处报告的结果可以推广到其他类似系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号