Hermitian Polynomial for Speaker Adaptation of Connectionist Speech Recognition Systems

Siniscalchi; S.M.; Li; J.; Lee; C.-H.

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Hermitian Polynomial for Speaker Adaptation of Connectionist Speech Recognition Systems

【24h】

Hermitian Polynomial for Speaker Adaptation of Connectionist Speech Recognition Systems

机译：埃尔米特多项式用于连接主义语音识别系统的说话人适应

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Model adaptation techniques are an efficient way to reduce the mismatch that typically occurs between the training and test condition of any automatic speech recognition (ASR) system. This work addresses the problem of increased degradation in performance when moving from speaker-dependent (SD) to speaker-independent (SI) conditions for connectionist (or hybrid) hidden Markov model/artificial neural network (HMM/ANN) systems in the context of large vocabulary continuous speech recognition (LVCSR). Adapting hybrid HMM/ANN systems on a small amount of adaptation data has been proven to be a difficult task, and has been a limiting factor in the widespread deployment of hybrid techniques in operational ASR systems. Addressing the crucial issue of speaker adaptation (SA) for hybrid HMM/ANN system can thereby have a great impact on the connectionist paradigm, which will play a major role in the design of next-generation LVCSR considering the great success reported by deep neural networks—ANNs with many hidden layers that adopts the pre-training technique—on many speech tasks. Current adaptation techniques for ANNs based on injecting an adaptable linear transformation network connected to either the input, or the output layer are not effective especially with a small amount of adaptation data, e.g., a single adaptation utterance. In this paper, a novel solution is proposed to overcome those limits and make it robust to scarce adaptation resources. The key idea is to adapt the hidden activation functions rather than the network weights. The adoption of Hermitian activation functions makes this possible. Experimental results on an LVCSR task demonstrate the effectiveness of the proposed approach.

机译：模型自适应技术是减少通常在任何自动语音识别（ASR）系统的训练和测试条件之间发生的不匹配的有效方法。这项工作解决了在以下情况下，当连接者（或混合）隐马尔可夫模型/人工神经网络（HMM / ANN）系统从与说话者相关的（SD）状态变为与说话者无关的（SI）条件时，性能下降的问题。大词汇量连续语音识别（LVCSR）。事实证明，在少量适应数据上适应混合HMM / ANN系统是一项艰巨的任务，并且已成为在运营ASR系统中广泛使用混合技术的限制因素。因此，解决混合HMM / ANN系统的说话人自适应（SA）的关键问题可能会对连接主义范式产生重大影响，考虑到深度神经网络报告的巨大成功，这将在下一代LVCSR的设计中发挥重要作用在许多语音任务上，采用预训练技术的具有许多隐藏层的人工神经网络。基于注入连接到输入层或输出层的自适应线性变换网络的用于ANN的当前自适应技术尤其在使用少量自适应数据（例如单个自适应话语）的情况下无效。在本文中，提出了一种新颖的解决方案来克服这些限制并使其对稀缺的适应资源具有鲁棒性。关键思想是调整隐藏的激活功能而不是网络权重。采用Hermitian激活功能可以实现这一点。 LVCSR任务的实验结果证明了该方法的有效性。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2013年第10期|2152-2161|共10页
作者
Siniscalchi; S.M.; Li; J.; Lee; C.-H.;
展开▼
作者单位

Department of Computer Engineering, Kore University of Enna, Enna, Italy|c|;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Artificial neural networks; model adaptation; speech processing;

机译：人工神经网络;模型自适应;语音处理;

相似文献

外文文献
中文文献
专利

1. Speaker clustering and transformation for speaker adaptation in speech recognition systems [J] . Padmanabhan M., Bahl L.R. IEEE Transactions on Speech and Audio Proceeding . 1998,第1期

机译：语音识别系统中的说话人适应和说话人聚类和转换
2. An Unsupervised Speaker Adaptation Method for Lecture-Style Spontaneous Speech Recognition Using Multiple Recognition Systems [J] . Seiichi NAKAGAWA, Tomohiro WATANABE, Hiromitsu NISHIZAKI, IEICE Transactions on Information and Systems . 2005,第3期

机译：基于多重识别系统的演讲风格自发语音识别的无监督说话人自适应方法
3. Air traffic control speech recognition system cross-task & speaker adaptation [J] . de Cordoba R., Ferreiros J., San-Segundo R., IEEE Aerospace and Electronic Systems Magazine . 2006,第9期

机译：空中交通管制语音识别系统跨任务和说话者自适应
4. Speaker adaptation for hybrid MMI/connectionist speech-recognition systems [C] . Rottland, J., Neukirchen, . 1998

机译：混合MMI /连接主义语音识别系统的说话人适应
5. Speaker Characteristic-based Acoustic Model Adaptation Method for Speaker Recognition Systems [D] . Millington, Daniel S. 2011

机译：基于说话者特征的说话人识别系统声学模型自适应方法
6. Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition [O] . Myungjong Kim, Younggwan Kim, Joohong Yoo, -1

机译：KL-HMM的正则化说话人适应用于音调异常语音识别
7. SPEAKER ADAPTATION FOR HYBRID MMI / CONNECTIONIST SPEECH RECOGNITION SYSTEMS [O] . 2008

机译：混合MMI / CONNECTIONIST语音识别系统的扬声器自适应

Hermitian Polynomial for Speaker Adaptation of Connectionist Speech Recognition Systems

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅