首页> 外文OA文献 >A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces

【2h】

A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces

机译：基于2D特征空间的跨语言语音情感识别研究

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation only. The assumption is that the speech audio signal carries sufficient emotional information to detect and retrieve it. Several two-dimensional acoustic feature spaces, such as cochleagrams, spectrograms, mel-cepstrograms, and fractal dimension-based space, are employed as the representations of speech emotional features. A convolutional neural network (CNN) is used as a classifier. The results show the superiority of cochleagrams over other feature spaces utilized. In the CNN-based speaker-independent cross-linguistic speech emotion recognition (SER) experiment, the accuracy of over 90% is achieved, which is close to the monolingual case of SER.

机译：在本研究中，执行了跨语言语音情感识别的研究。为此，收集了不同语言的情感数据（英语，立陶宛语，德语，西班牙语，塞尔维亚和波兰语），导致跨语言语音情绪数据集，其尺寸超过10.000个情绪。尽管收集了数据库的双模形角色，但我们的重点仅在声学表示上。假设是语音音频信号带有足够的情绪信息来检测和检索它。几个二维声学特征空间，例如耳蜗，谱图，Mel-epthrems和基于分形维数的空间，作为语音情绪特征的表示。卷积神经网络（CNN）用作分类器。结果表明，在所使用的其他特征空间上显示了耳蜗仪的优越性。在基于CNN的扬声器无关的交叉语言语言情绪识别（SER）实验中，实现了超过90％的准确性，这是靠近SER的单声道情况。

著录项

作者
Gintautas Tamulevičius; Gražina Korvel; Anil Bora Yayak; Povilas Treigys; Jolita Bernatavičienė; Bożena Kostek;
展开▼
作者单位

展开▼
年度 2020
总页数
原文格式 PDF
正文语种 eng
中图分类

相似文献

外文文献
中文文献
专利

1. Analysis of 2D Feature Spaces for Deep Learning-Based Speech Recognition [J] . GRAZINA KORVEL, POVILAS TREIGYS, GINTAUTAS TAMULEVICIUS, Journal of the Audio Engineering Society . 2018,第12期

机译：基于深度学习的语音识别的2D特征空间分析
2. Feature Selection Based Transfer Subspace Learning for Speech Emotion Recognition [J] . Song Peng, Zheng Wenming Affective Computing, IEEE Transactions on . 2020,第3期

机译：基于特征基于选择的转移子空间学习语言情感识别
3. Speech emotion recognition using hybrid spectral-prosodic features of speech signal/glottal waveform, metaheuristic-based dimensionality reduction, and Gaussian elliptical basis function network classifier [J] . Daneshfar Fatemeh, Kabudian Seyed Jahanshah, Neekabadi Abbas Applied Acoustics . 2020,第Sepa期

机译：语音情感识别使用语音信号/光学波形的混合谱 - 韵律特征，基于血管训练的维数减少和高斯椭圆形基函数网络分类器
4. A Smart Space with Music Selection Feature Based on Face and Speech Emotion and Expression Recognition [C] . Jose Martin Z. Maningo, Argel A. Bandala, Ryan Rhay P. Vicerra, IEEE Region 10 Conference . 2020

机译：基于面部和语音情感和表达识别的音乐选择功能的智能空间
5. Domain Adaptation for Speech Based Emotion Recognition [D] . Abdelwahab, Mohammed. 2019

机译：基于语音情感识别的域适应
6. Particle Swarm Optimization Based Feature Enhancement and Feature Selection for Improved Emotion Recognition in Speech and Glottal Signals [O] . Hariharan Muthusamy, Kemal Polat, Sazali Yaacob -1

机译：基于粒子群优化的特征增强和特征选择用于语音和声门信号中的情感识别
7. Attentive Convolutional Neural Network based Speech Emotion Recognition: A Study on the Impact of Input Features, Signal Length, and Acted Speech [O] . Neumann, Michael, Vu, Ngoc Thang 2017

机译：基于卷积神经网络的语音情感识别：输入特征，信号长度和作用语音的影响研究

A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅