A Rapid Model Adaptation Technique for Emotional Speech Recognition with Style Estimation Based on Multiple-Regression HMM

Yusuke IJIMA; Takashi NOSE; Makoto TACHIBANA; Takao KOBAYASHI

首页> 外文期刊>IEICE transactions on information and systems >A Rapid Model Adaptation Technique for Emotional Speech Recognition with Style Estimation Based on Multiple-Regression HMM

【24h】

A Rapid Model Adaptation Technique for Emotional Speech Recognition with Style Estimation Based on Multiple-Regression HMM

机译：基于多元回归HMM的带样式估计的情感语音快速模型自适应技术

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a rapid model adaptation technique for emotional speech recognition which enables us to extract paralinguistic information as well as linguistic information contained in speech signals. This technique is based on style estimation and style adaptation using a multiple-regression HMM (MRHMM). In the MRHMM, the mean parameters of the output probability density function are controlled by a low-dimensional parameter vector, called a style vector, which corresponds to a set of the explanatory variables of the multiple regression. The recognition process consists of two stages. In the first stage, the style vector that represents the emotional expression category and the intensity of its expressiveness for the input speech is estimated on a sentence-by-sentence basis. Next, the acoustic models are adapted using the estimated style vector, and then standard HMM-based speech recognition is performed in the second stage. We assess the performance of the proposed technique in the recognition of simulated emotional speech uttered by both professional narrators and non-professional speakers.

机译：在本文中，我们提出了一种用于情感语音识别的快速模型自适应技术，该技术使我们能够提取语音信号中包含的副语言信息和语言信息。该技术基于使用多回归HMM（MRHMM）的样式估计和样式适应。在MRHMM中，输出概率密度函数的平均参数由称为样式矢量的低维参数矢量控制，该矢量对应于多元回归的一组解释变量。识别过程包括两个阶段。在第一阶段中，以逐句为基础估计代表情感表达类别的风格矢量及其表达对输入语音的强度。接下来，使用估计的样式矢量调整声学模型，然后在第二阶段执行基于标准HMM的语音识别。我们评估了该技术在识别专业叙述者和非专业说话者发出的模拟情感性语音时的性能。

著录项

来源
《IEICE transactions on information and systems》 |2010年第1期|共9页
作者
Yusuke IJIMA; Takashi NOSE; Makoto TACHIBANA; Takao KOBAYASHI;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. A Rapid Model Adaptation Technique for Emotional Speech Recognition with Style Estimation Based on Multiple-Regression HMM [J] . Yusuke IJIMA, Takashi NOSE, Makoto TACHIBANA, IEICE Transactions on Information and Systems . 2010,第1期

机译：基于多元回归HMM的带样式估计的情感语音快速模型自适应技术
2. A Technique for Estimating Intensity of Emotional Expressions and Speaking Styles in Speech Based on Multiple-Regression HSMM [J] . Takashi NOSE, Takao KOBAYASHI IEICE transactions on information and systems . 2010,第1期

机译：基于多元回归HSMM的语音情感表达强度和说话风格估计技术
3. A Technique for Estimating Intensity of Emotional Expressions and Speaking Styles in Speech Based on Multiple-Regression HSMM [J] . Takashi NOSE, Takao KOBAYASHI IEICE Transactions on Information and Systems . 2010,第1期

机译：基于多元回归HSMM的语音情感表达强度和说话风格估计技术
4. An On-line Adaptation Technique for Emotional Speech Recognition Using Style Estimation with Multiple-Regression HMM [C] . Yusuke Ijima, Makoto Tachibana, Takashi Nose, International Speech Communication Association . 2008

机译：用多元回归嗯使用风格估计的情绪语音识别的在线适应技术
5. Modeling articulatory dynamics using HMM techniques for automatic speech recognition. [D] . Erler, Kevin J. 1994

机译：使用HMM技术对发音动力学进行建模以实现自动语音识别。
6. Estimation of Phoneme-Specific HMM Topologies for the Automatic Recognition of Dysarthric Speech [O] . Santiago-Omar Caballero-Morales 2013

机译：语音异常自动识别的音素特定HMM拓扑估计
7. A Rapid Model Adaptation Technique for Emotional Speech Recognition with Style Estimation Based on Multiple-Regression HMM [O] . Yusuke IJIMA, Takashi NOSE, Makoto TACHIBANA, 2010

机译：基于多元回归HMM的情感语音识别快速模型适应技术

A Rapid Model Adaptation Technique for Emotional Speech Recognition with Style Estimation Based on Multiple-Regression HMM

摘要

著录项

相似文献

相关主题

期刊订阅