首页> 外文会议> >Speech recognition using sub-word neural tree network models and multiple classifier fusion

【24h】

Speech recognition using sub-word neural tree network models and multiple classifier fusion

机译：使用子词神经树网络模型和多分类器融合的语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A new neural tree network (NTN)-based speech recognition system is presented. NTN is a hierarchial classifier that combines the properties of decision trees and feed-forward neural networks. In the sub-word unit-based system, the NTNs model the sub-word speech segments, while the Viterbi algorithm is used for temporal alignment. Durational probability is associated with each sub-word NTN. An iterative algorithm is proposed for training the sub-word NTNs. The sub-word NTN models, as well as the subword segment boundaries within a vocabulary word, are re-estimated. Thus, the proposed system is a homogeneous neural network-based, sub-word unit-based, speech recognition system. Furthermore, embedded within this word model paradigm, multiple NTNs are trained for each subword segment and their output decisions are combined or fused to yield improved performance. The proposed discriminatory training-based system did not perform favourably as compared to a hidden Markov model-based system. The paradigm presented in this paper can be argued to represent a class of discriminatory training-based, homogeneous (versus hybrid), sub-word unit-based, speech recognition systems. Hence, the results reported here can be generalized to other similar systems.

机译：提出了一种新的基于神经树网络（NTN）的语音识别系统。 NTN是一种分层分类器，结合了决策树和前馈神经网络的属性。在基于子词单元的系统中，NTN为子词语音片段建模，而维特比算法用于时间对齐。持续时间概率与每个子词NTN相关联。提出了一种迭代算法来训练子词NTN。重新估计子词NTN模型以及词汇词中的子词片段边界。因此，所提出的系统是基于同类神经网络，基于子词单元的语音识别系统。此外，嵌入到此单词模型范式中的每个子单词片段都训练了多个NTN，并且将它们的输出决策进行组合或融合以提高性能。与基于隐马尔可夫模型的系统相比，所提出的基于歧视性训练的系统表现不佳。本文提出的范式可以被认为代表了一类基于歧视性训练的，同质（相对于混合），基于子词单元的语音识别系统。因此，此处报告的结果可以推广到其他类似系统。

著录项

来源
《》|1995年|P.3323-3326|共4页
会议地点
作者
Sharma; M.; Mammone; R.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. State-Clustering Based Multiple Deep Neural Networks Modeling Approach for Speech Recognition [J] . Zhou Pan, Jiang Hui, Dai Li-Rong, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2015,第4期

机译：基于状态聚类的多深度神经网络语音识别建模方法
2. A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech [J] . Yan-Hui Tu, Jun Du, Chin-Hui Lee Journal of signal processing systems for signal, image, and video technology . 2018,第7期

机译：基于说话者的基于深度神经网络的单通道联合语音分离和声学建模方法，用于多语音对话的鲁棒识别
3. Audio-visual feature fusion via deep neural networks for automatic speech recognition [J] . Mohammad Hasan Rahmani, Farshad Almasganj, Seyyed Ali Seyyedsalehi Digital Signal Processing . 2018,第期

机译：通过深度神经网络进行视听功能融合，用于自动语音识别
4. Ensemble Classifier based on Decision-Fusion of Multiple Models for Speech Emotion Recognition [C] . Kyoungju Noh, Jiyoun Lim, Seungeun Chung, International Conference on Information and Communication Technology Convergence . 2018

机译：基于多模型决策融合的集成语音分类器
5. Modeling and learning in speech recognition: The relationship between stochastic pattern classifiers and neural networks. [D] . Niles, Leslie Thomas. 1991

机译：语音识别中的建模和学习：随机模式分类器与神经网络之间的关系。
6. Electroencephalography Based Fusion Two-Dimensional (2D)-Convolution Neural Networks (CNN) Model for Emotion Recognition System [O] . Yea-Hoon Kwon, Sae-Byuk Shin, Shin-Dug Kim 2018

机译：基于脑电图的融合二维（2D）-卷积神经网络（CNN）模型用于情绪识别系统
7. A Neural Network Using Acoustic Sub-Word Units For Continuous Speech Recognition [O] . Ha-jin Yu, Yung-hwan Oh 2007

机译：一种利用声学子字单元进行连续语音识别的神经网络
8. Neural Network Classifiers for Speech Recognition [R] . Lippman, R. S. 1988

机译：用于语音识别的神经网络分类器

Speech recognition using sub-word neural tree network models and multiple classifier fusion

摘要

著录项

相似文献

相关主题

期刊订阅