Screening Trauma Through CNN-Based Voice Emotion Classification

机译：通过基于CNN的语音情感分类筛选创伤

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, modern people experience trauma symptom for various reasons. Trauma causes emotional control problems and anxiety. Although a psychiatric diagnosis is essential, people are reluctant to visit hospitals. In this paper, we propose a method for screening trauma based on voice audio data using convolu-tional neural networks. Among the six basic emotions, four emotions were used for screening trauma: fear, sad, happy, and neutral. The first pre-processing of adjusting the length of the audio data in units of 2 s and augmenting the number of data, and the second pre-processing is performed in order to convert voice temporal signal into a spectrogram image by short-time Fourier transform. The spectrogram images are trained through the four convolution neural networks. As a result, VGG-13 model showed the highest performance (98.96%) for screening trauma among others. A decision-level fusion strategy as a post-processing is adopted to determine the final traumatic state by confirming the maintenance of the same continuous state for the traumatic state estimated by the trained VGG-13 model. As a result, it was confirmed that high-accuracy voice-based trauma diagnosis is possible according to the setting value for continuous state observation.

机译：最近，现代人因各种原因体验创伤症状。创伤导致情绪控制问题和焦虑。虽然精神诊断至关重要，但人们不愿意参观医院。在本文中，我们提出了一种基于使用卷积神经网络的语音音频数据筛选创伤的方法。在六种基本情绪中，四种情绪用于筛查创伤：恐惧，悲伤，快乐和中立。以2 s为单位调整音频数据的长度并增强数据数量的第一个预处理，以及通过短时傅里叶变换将语音时间信号转换为频谱图图像中的第二预处理。频谱图图像通过四个卷积神经网络培训。结果，VGG-13模型显示出最高的性能（98.96％），用于筛选创伤等。采用决策级融合策略作为后处理来确定最终创伤状态，通过确认由训练的VGG-13模型估计的创伤状态的保持相同的连续状态。结果，确认，根据连续状态观察的设定值，可以实现高精度的语音基创伤诊断。

著录项

来源
《International Conference on Intelligent Human Computer Interaction》|2020年|208-217|共10页
会议地点
作者
Na Hye Kim; So Eui Kim; Ji Won Mok; Su Gyeong Yu; Na Yeon Han; Eui Chul Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Trauma; Convolution neural network; Audio; Voice; Emotion;

机译：创伤;卷积神经网络;声音的;语音;情感;

相似文献

外文文献
中文文献
专利

1. What's in a singer's voice: The effect of attachment, emotions and trauma [J] . Monti Elisa, Kidd David C., Carroll Linda M., Logopedics, phoniatrics, vocology. . 2017,第1a4期

机译：歌手的声音是什么：依恋，情绪和创伤的效果
2. Emotion in the singing voice—a deeperlook at acoustic features in the light ofautomatic classification [J] . Florian Eyben, Gláucia L Salom?o, Johan Sundberg, EURASIP journal on audio, speech, and music processing . 2015,第1期

机译：歌声中的情感-根据自动分类更深入地了解声学特征
3. Power and perceived expressed emotion of voices: Their impact on depression and suicidal thinking in those who hear voices [J] . Connor C., Birchwood M. Clinical Psychology & Psychotherapy . 2013,第3期

机译：声音的力量和感知的表达情感：它们对听到声音的人的抑郁和自杀思维的影响
4. THE EFFECTS OF SOUND AND VIBRATION TO BIOLOGICAL RHYHTM SYSTEM IN HUMAN ORGANISM NON-INVASIVE SCREENING, ANALYSIS AND MODULATION VIA VOICE FREQUENCIES WITH ESS (EMOTION AND STRESS SCREENING)* RFM (RHYTHMIC FREQUENCY MODULATION)* [C] . Annegret Heinen, Arno Heinen International Congress on Sound and Vibration . 2006

机译：声音和振动对生物节律系统的影响人体有机体无侵入性筛选，分析和调制与ESS（情感和应激筛选）*＆RFM（节奏频率调制）*
5. How Informed Is "Trauma-Informed"? The Voices of Black Male Principals in Urban High Schools Concerning Trauma-Informed School Policy [D] . Williams, Amber Audria. 2020

机译：如何通知是“创伤信息”？关于创伤学校政策的城市高中黑男校长的声音
6. Deep-Net: A Lightweight CNN-Based Speech Emotion Recognition System Using Deep Frequency Features [O] . Tursunov Anvarjon, Mustaqeem, Soonil Kwon 2020

机译：深网络：使用深频特征的基于轻量级CNN的语音情感识别系统
7. Cascaded emotion classification via psychological emotion dimensions using a large set of voice quality parameters [O] . Marko Lugger, Bin Yang 2014

机译：使用大量语音质量参数通过心理情感维度进行级联情感分类

Screening Trauma Through CNN-Based Voice Emotion Classification

摘要

著录项

相似文献

相关主题

期刊订阅