首页> 外文会议> >Fast speaker adaptation combined with soft vector quantization in an HMM speech recognition system

【24h】

Fast speaker adaptation combined with soft vector quantization in an HMM speech recognition system

机译：HMM语音识别系统中的快速说话人自适应与软矢量量化相结合

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The authors describe a method for combining speaker adaptation by feature vector transformation with semi-continuous hidden Markov modeling (SCHMM). Since the reference speaker's voice is represented in the SCHMM system by multidimensional Gaussian distributions, it is these distributions rather than feature vectors that must be transformed. The performance of hard-decision vector quantization (HVQ), soft-decision VQ (SVQ), and SCHMM are compared as are the speaker-adaptive and speaker-independent systems. In addition, the influence of dynamic features is investigated. The definition of subword units is optimized, and, with respect to full or diagonal covariance matrices and codebook size, the SCHMM system is optimized. Model initialization and distribution reestimation during training is introduced. Significant improvements are obtained compared to previously reported systems based on HVQ: from 71.6% to 84.6% (speaker-independent) and from 80.4% to 87.4% (speaker-adaptive) mean recognition rate under difficult conditions.

机译：作者描述了一种通过特征向量变换将说话人自适应与半连续隐马尔可夫建模（SCHMM）相结合的方法。由于参考说话人的声音在SCHMM系统中是通过多维高斯分布表示的，因此必须转换这些分布而不是特征向量。比较了硬决策矢量量化（HVQ），软决策VQ（SVQ）和SCHMM的性能，以及说话人自适应和独立于说话者的系统。此外，还研究了动态特征的影响。优化子字单元的定义，并且针对完整或对角协方差矩阵和码本大小，对SCHMM系统进行优化。介绍了训练期间的模型初始化和分布重新估计。与以前基于HVQ的报告系统相比，获得了显着改进：在困难条件下的平均识别率从71.6％提高到84.6％（独立于说话者），从80.4％提高到87.4％（独立于说话者）。

著录项

来源
《》|1992年|P.461-464|共4页
会议地点
作者
Class; F.; Kaltenmeir; A.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Optimum Hmm Combined With Vector Quantization For Hindi Speech Word Recognition [J] . Poonam Bansal, Amita Dev and Shail Bala Jain IETE Journal of Research . 2008,第4期

机译：最优Hmm与矢量量化相结合的印地语语音单词识别
2. Improved Vector Quantization Approach for Discrete HMM Speech Recognition System [J] . Maryam Madani, Shadpour Mallakpour The international arab journal of information technology . 2006,第4期

机译：离散HMM语音识别系统的改进矢量量化方法
3. Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System [J] . Dhanalakshmi M., Celin T. A. Mariya, Nagarajan T., Circuits, systems, and signal processing . 2018,第2期

机译：基于HMM的语音识别和自适应合成系统的韵律演讲者的语音输入语音输出通信
4. Fast speaker adaptation combined with soft vector quantization in an HMM speech recognition system [C] . Class F., Kaltenmeir A., Institute of Electric and Electronic Engineer IEEE International Conference on Acoustics, Speech, and Signal Processing . 1992

机译：快速扬声器适配与嗯语音识别系统中的软矢量量化结合
5. A neural predictive HMM architecture for speech and speaker recognition. [D] . Hassanein, Khaled Saad. 1994

机译：用于语音和说话者识别的神经预测HMM架构。
6. Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition [O] . Myungjong Kim, Younggwan Kim, Joohong Yoo, -1

机译：KL-HMM的正则化说话人适应用于音调异常语音识别
7. A study of LSF representation for speaker dependent and speaker independent HMM-based speech recognition systems [O] . K. K. Paliwal 1990

机译：基于说话人和与说话人无关的基于HMM的语音识别系统的LSF表示研究

Fast speaker adaptation combined with soft vector quantization in an HMM speech recognition system

摘要

著录项

相似文献

相关主题

期刊订阅