首页> 外文会议>International Conference on Affective Computing and Intelligent Interaction >Modeling variable length phoneme sequences — A step towards linguistic information for speech emotion recognition in wider world
【24h】

Modeling variable length phoneme sequences — A step towards linguistic information for speech emotion recognition in wider world

机译:可变长度音素序列建模-迈向更广泛世界中用于语音情感识别的语言信息的一步

获取原文

摘要

Vocal gestures play an important role in emotion expression and can be used by speech based emotion recognition systems. This paper proposes the use of BLSTM neural networks to model salient variable length phoneme sequences, which in turn can represent relevant vocal gestures. Unlike existing techniques, the proposed approach is not restricted to modelling phoneme sequences of a fixed length and both salience and optimal modelling length of phoneme sequences are learnt from the training data. Three possible phoneme representations that can be modelled by BLSTMs are compared and experimental results suggest that sequences of Phone Log Likelihood Ratios are more representative of emotions when compared to sequences of phoneme labels represented as one - hot vectors. On the IEMOCAP database, the proposed approach achieves an Unweighted Average Recall (UAR) of 56.4%, an improvement of 6.5% in absolute terms over the previous approach of modelling fixed length phoneme sequences on a 4-class classification problem. The proposed linguistic system is complementary to acoustic features with a fused system leading to an absolute improvement of 5% to the UAR.
机译:语音手势在情感表达中起着重要作用,并且可以被基于语音的情感识别系统使用。本文提出了使用BLSTM神经网络对显着的可变长度音素序列进行建模的方法,而这些音素序列又可以表示相关的语音手势。与现有技术不同,所提出的方法不限于对固定长度的音素序列进行建模,并且从训练数据中学习音素序列的显着性和最佳建模长度。比较了可以用BLSTM建模的三种可能的音素表示,实验结果表明,与以单热向量表示的音素标签序列相比,音素似然比序列更能代表情绪。在IEMOCAP数据库上,所提出的方法实现了56.4%的未加权平均召回率(UAR),与在4类分类问题上为固定长度音素序列建模的先前方法相比,绝对值提高了6.5%。所提出的语言系统是声学功能的补充,融合了系统,使UAR绝对提高了5%。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号