Improving Valence Prediction in Dimensional Speech Emotion Recognition Using Linguistic Information

机译：使用语言信息改善维度语音情感识别的价预测

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In dimensional emotion recognition, a model called valence, arousal, and dominance is widely used. The current research in dimensional speech emotion recognition has shown a problem that the performance of valence prediction is lower than arousal and dominance. This paper presents an approach to tackle this problem: improving the low score of valence prediction by utilizing linguistic information. Our approach fuses acoustic features with linguistic features, which is a conversion from words to vectors. The results doubled the performance of valence prediction on both single-task learning single-output (predicting valence only) and multitask learning multi-output (predicting valence, arousal, and dominance). Using a proper combination of acoustic and linguistic features not only improved valence prediction, but also improved arousal and dominance predictions in multitask learning.

机译：在尺寸情绪识别中，广泛使用称为价，唤醒和优势的型号。尺寸语音情感识别的目前的研究表明了价值预测的性能低于唤醒和优势的问题。本文提出了一种解决这个问题的方法：通过利用语言信息，提高价值的低分预测。我们的方法融合了语言特征的声学功能，这是从单词到向量的转换。结果一倍增加了对单任务学习单输出（仅限预测价值）和多任务学习多输出（预测价，唤醒和占优势）的价值预测的性能。使用正确的声学和语言特征的适当组合不仅改善了价值预测，而且还改善了多族学习中的唤醒和优势预测。

著录项

来源
《Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques》|2020年|166-171|共6页
会议地点
作者
Bagus Tris Atmaja; Masato Akagi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Acoustics; Linguistics; Emotion recognition; Feature extraction; Speech recognition; Task analysis; Bit error rate;

机译：声学;语言学;情绪识别;特征提取;语音识别;任务分析;误码率;

相似文献

外文文献
中文文献
专利

1. Speech emotion recognition using hybrid spectral-prosodic features of speech signal/glottal waveform, metaheuristic-based dimensionality reduction, and Gaussian elliptical basis function network classifier [J] . Daneshfar Fatemeh, Kabudian Seyed Jahanshah, Neekabadi Abbas Applied Acoustics . 2020,第Sepa期

机译：语音情感识别使用语音信号/光学波形的混合谱 - 韵律特征，基于血管训练的维数减少和高斯椭圆形基函数网络分类器
2. On-line emotion recognition in a 3-D activation-valence-time continuum using acoustic and linguistic cues [J] . Florian Eyben, Martin Woellmer, Alex Graves, Journal on multimodal user interfaces . 2010,第1a2期

机译：使用声音和语言线索在3-D激活价时间连续体内进行在线情感识别
3. Children’s Emotion Recognition from Spontaneous Speech Using a Reduced Set of Acoustic and Linguistic Features [J] . Santiago Planet, Ignasi Iriondo Cognitive Computation . 2013,第4期

机译：使用减少的一组语音和语言功能从自发语音中识别儿童的情绪
4. Emotions in speech - experiments with prosody and quality features in speech for use in categorical and dimensional emotion recognition environments [C] . Borchert, M., Dusterhoft, . 2005

机译：语音中的情感-具有语音韵律和质量特征的实验，用于类别和维度情感识别环境
5. Automatic Speech Recognition Techniques for Diagnostic Predictions of Human Health Disorders [D] . Sadeghian, Roozbeh 2017

机译：自动语音识别技术用于人类健康疾病的诊断预测
6. The role of linguistic and indexical information in improved recognition ofdysarthric speech [O] . Stephanie A. Borrie, a), Megan J. McAuliffe, -1

机译：语言和索引信息在改善对信息的识别中的作用构音障碍
7. On-line Emotion Recognition in a 3-D Activation-Valence-Time Continuum using Acoustic and Linguistic Cues [O] . Eyben, F., Wollmer, M., Graves, A., 2010

机译：使用声音和语言提示的3-D激活价时间连续体中的在线情感识别
8. Prosodic Aids to Speech Recognition: VI. Timing Cues to Linguistic Structure and Improved Computer Programs for Prosodic Analysis. [R] . Lea, W. A., Kloker, D. R. 1975

机译：韵律语音识别助手：VI。韵律分析的计算机程序。

Improving Valence Prediction in Dimensional Speech Emotion Recognition Using Linguistic Information

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅