【24h】

Improving Valence Prediction in Dimensional Speech Emotion Recognition Using Linguistic Information

机译:使用语言信息改善维度语音情感识别的价预测

获取原文
获取外文期刊封面目录资料

摘要

In dimensional emotion recognition, a model called valence, arousal, and dominance is widely used. The current research in dimensional speech emotion recognition has shown a problem that the performance of valence prediction is lower than arousal and dominance. This paper presents an approach to tackle this problem: improving the low score of valence prediction by utilizing linguistic information. Our approach fuses acoustic features with linguistic features, which is a conversion from words to vectors. The results doubled the performance of valence prediction on both single-task learning single-output (predicting valence only) and multitask learning multi-output (predicting valence, arousal, and dominance). Using a proper combination of acoustic and linguistic features not only improved valence prediction, but also improved arousal and dominance predictions in multitask learning.
机译:在尺寸情绪识别中,广泛使用称为价,唤醒和优势的型号。尺寸语音情感识别的目前的研究表明了价值预测的性能低于唤醒和优势的问题。本文提出了一种解决这个问题的方法:通过利用语言信息,提高价值的低分预测。我们的方法融合了语言特征的声学功能,这是从单词到向量的转换。结果一倍增加了对单任务学习单输出(仅限预测价值)和多任务学习多输出(预测价,唤醒和占优势)的价值预测的性能。使用正确的声学和语言特征的适当组合不仅改善了价值预测,而且还改善了多族学习中的唤醒和优势预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号