Speech ermotion recognition separately from voiced and unvoiced sound for emotional interaction robot

机译：用于情感交互机器人的语音情感识别与有声和无声分开

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The purpose of this paper is to describe the realization of speech emotion recognition. Generally, text-independent mode has been utilized for speech emotion recognition, hence previous researches have discounted that emotion features vary according to the text or phonemes, though this can distort the classification performance. To overcome this distortion, a framework of speech emotion recognition is proposed based on segmentation of voiced and unvoiced sound. Voiced and unvoiced sound have different characteristic of emotion features as vocalization between voiced sound and unvoiced sound is much different hence, they should be considered separately. In this paper, voiced and unvoiced sound classification is performed using spectral flatness measures and the spectral center, and a Gaussian mixture model with five mixtures was employed for emotion recognition. To confirm the proposed framework, two systems are compared: the first is emotion classification using whole utterances (ordinary method) and the second uses segments of voiced and unvoiced sound (proposed method). The proposed approach yields higher classification rates compared to previous systems in both cases using each of the emotion features (linear prediction coding (LPC), Mel-frequency cepstral coefficients (MFCCs), perceptual linear prediction (PLP) and energy) as well as a combination of these four features.

机译：本文的目的是描述语音情感识别的实现。通常，独立于文本的模式已被用于语音情感识别，因此先前的研究认为情感特征会根据文本或音素而变化，尽管这可能会使分类性能失真。为了克服这种失真，提出了一种基于语音和清音的分割的语音情感识别框架。浊音和清音具有不同的情感特征，因为浊音和清音之间的发声差异很大，因此应分开考虑。在本文中，使用频谱平坦度测量和频谱中心对有声和无声声音进行分类，并采用具有五种混合物的高斯混合模型进行情感识别。为了确认所提出的框架，比较了两个系统：第一个是使用整体话语的情感分类（常规方法），第二个是使用有声和无声段（提议的方法）。在使用两种情感特征（线性预测编码（LPC），梅尔频率倒谱系数（MFCC），感知线性预测（PLP）和能量）以及两种情感特征的两种情况下，与以前的系统相比，提出的方法均产生了更高的分类率。这四个功能的结合。

著录项

来源
《Control, Automation and Systems -ICCAS, 2008 International Conference on》||P.2014-2019|共6页
会议地点
作者
Kim Eun Ho; Hyun Kyung Hak; Kim Soo Hyun; Kwak Yoon Keun;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类工业技术;
关键词
HRI; segment based approach; speech emotion recognition;

机译：HRI;基于细分的方法;语音情感识别;

相似文献

外文文献
中文文献
专利

1. Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency [J] . David Marion, Lavandier Mathieu, Grimault Nicolas, Hearing Research: An International Journal . 2017,第期

机译：基于基础频率的浊音和清音语音的顺序流分离
2. A PATTERN RECOGNITION APPROACH TO ROBUST VOICED/UNVOICED SPEECH CLASSIFICATION USING FUZZY LOGIC [J] . FRANCESCO BERITELI, SALVATORE CASALE, MARCO RUSSO International Journal of Pattern Recognition and Artificial Intelligence . 1999,第1期

机译：基于模糊逻辑对语音/语音语音分类的模式识别方法
3. Effect Of Retroflex Sounds On The Recognition Of Hindi Voiced And Unvoiced Stops [J] . Amita Dev AI & society . 2009,第4期

机译：Retroflex声音对北印度语浊音和浊音识别的影响
4. Speech ermotion recognition separately from voiced and unvoiced sound for emotional interaction robot [C] . Kim Eun Ho, Hyun Kyung Hak, Kim Soo Hyun, International Conference on Control, Automation and Systems . 2008

机译：语音情感识别与情感互动机器人的浊音和清音声音
5. Voice Recognition and Human Child Robot Interaction [D] . Yeshaswini, Sukavasi. 2020

机译：语音识别与人体儿童机器人互动
6. Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency [O] . Marion David, Mathieu Lavandier, Nicolas Grimault, -1

机译：基于基频的有声和无声语音流的顺序流分离
7. Speech emotion recognition in emotional feedbackfor Human-Robot Interaction [O] . Rázuri, Javier G., Sundgren, David, Rahmani, Rahim, 2015

机译：人机交互情感反馈中的语音情感识别

Speech ermotion recognition separately from voiced and unvoiced sound for emotional interaction robot

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅