首页> 外文期刊>Pattern recognition letters >Speech emotion recognition model based on Bi-GRU and Focal Loss
【24h】

Speech emotion recognition model based on Bi-GRU and Focal Loss

机译:基于Bi-Gru和焦点的语音情感识别模型

获取原文
获取原文并翻译 | 示例
           

摘要

For the problems of inconsistent sample duration and unbalance of sample categories in the speech emotion corpus, this paper proposes a speech emotion recognition model based on Bi-GRU (Bidirection Gated Recurrent Unit) and Focal Loss. The model has been improved on the basis of learning CRNN (Convolutional Recurrent Neural Network) deeply. In CRNN, Bi-GRU is used to effectively lengthen the samples of the speech with short duration, and Focal Loss function is used to deal with the difficulties in classification caused by the imbalance of emotional categories of the samples. Through different methods for experimental comparison, weighted average recall (WAR), unweighted average recall (UAR) and confusion matrix (CM) are used as evaluation index of the algorithm. The experimental results show that the speech emotion recognition model proposed in this paper improves the recognition accuracy and the imbalance of IEMOCAP database samples, and can effectively prove that the improvement of speech emotion recognition performance is not due to the adjustment of model parameters or the change of the model topology. (c) 2020 Elsevier B.V. All rights reserved.
机译:对于语音情绪语料库中的样本持续时间和样品类别不平衡的问题,本文提出了一种基于Bi-Gru(双向门控复发单元)和焦损的语音情感识别模型。该模型在深深学习CRNN(卷积经常性神经网络)的基础上得到了改进。在CRNN中,Bi-Gru用于有效地延长短持续时间的语音的样本,并且使用焦损函数来处理由样品的情绪类别的不平衡引起的分类困难。通过不同的实验比较方法,加权平均召回(战争),未加权的平均召回(UAR)和混淆矩阵(CM)用作算法的评估指标。实验结果表明,本文提出的语音情绪识别模型提高了识别准确性和IEMocap数据库样本的不平衡,可以有效证明语音情绪识别性能的提高不是由于模型参数的调整或变化的调整模型拓扑。 (c)2020 Elsevier B.v.保留所有权利。

著录项

  • 来源
    《Pattern recognition letters》 |2020年第12期|358-365|共8页
  • 作者单位

    Guangdong Univ Foreign Studies South China Business Coll Sch Informat Sci & Technol Guangzhou 510545 Peoples R China|Guangdong Univ Foreign Studies South China Business Coll Inst Intelligent Informat Proc Guangzhou 510545 Peoples R China;

    Guangdong Univ Foreign Studies South China Business Coll Human Resources Dept Guangzhou 510545 Peoples R China;

    Guangdong Univ Foreign Studies South China Business Coll Sch Informat Sci & Technol Guangzhou 510545 Peoples R China;

    Guangdong Univ Foreign Studies South China Business Coll Sch Informat Sci & Technol Guangzhou 510545 Peoples R China|Guangdong Univ Foreign Studies South China Business Coll Inst Intelligent Informat Proc Guangzhou 510545 Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Bi-GRU; Focal loss; Speech emotion recognition; Deep learning; CRNN;

    机译:bi-gru;焦点;语音情感识别;深入学习;CRNN;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号