...
首页> 外文期刊>Consumer Electronics, IEEE Transactions on >Feature vector classification based speech emotion recognition for service robots
【24h】

Feature vector classification based speech emotion recognition for service robots

机译:基于特征向量分类的服务机器人语音情感识别

获取原文
获取原文并翻译 | 示例
           

摘要

This paper proposes an efficient feature vector classification for Speech Emotion Recognition (SER) in service robots. Since service robots interact with diverse users who are in various emotional states, two important issues should be addressed: acoustically similar characteristics between emotions and variable speaker characteristics due to different user speaking styles. Each of these issues may cause a substantial amount of overlap between emotion models in feature vector space, thus decreasing SER accuracy. In order to reduce the effects caused by such overlaps, this paper proposes an efficient feature vector classification for SER. The conventional feature vector classification applied to speaker identification categorizes feature vectors as overlapped and non-overlapped. Because this method discards all of the overlapped vectors in model reconstruction, it has limitations in constructing robust models when the number of overlapped vectors is significantly increased such as in emotion recognition. The method proposed herein classifies overlapped vectors in a more sophisticated manner, selecting discriminative vectors among overlapped vectors, and adds those vectors in model reconstruction. On SER experiments using an emotional speech corpus, the proposed classification approach exhibited superior performance to conventional methods, and displayed an almost human-level performance. In particular, we achieved commercially applicable performance for two-class (negative vs. non-negative) emotion recognition.
机译:本文提出了一种服务机器人中语音情感识别(SER)的有效特征向量分类方法。由于服务机器人与处于各种情绪状态的不同用户进行交互,因此应解决两个重要的问题:情绪之间的声学​​上相似的特征以及由于不同用户讲话风格而导致的说话者特征的变化。这些问题中的每一个都可能导致特征向量空间中情感模型之间的大量重叠,从而降低SER准确性。为了减少这种重叠造成的影响,本文提出了一种有效的SER特征向量分类方法。应用于说话者识别的常规特征向量分类将特征向量分类为重叠和不重叠。因为该方法在模型重建中丢弃了所有重叠向量,所以当重叠向量的数量显着增加时(例如在情感识别中),它在构建鲁棒模型方面具有局限性。本文提出的方法以更复杂的方式对重叠向量进行分类,在重叠向量中选择判别向量,并将这些向量添加到模型重建中。在使用情感语音语料库的SER实验中,提出的分类方法表现出优于常规方法的性能,并且显示出几乎与人类水平相同的性能。特别是,我们实现了两类(负与非负)情感识别的商业适用性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号