Segment Repetition Based on High Amplitude to Enhance a Speech Emotion Recognition

Bagas Adi Prayitno; Suyanto Suyanto

首页> 外文期刊>Procedia Computer Science >Segment Repetition Based on High Amplitude to Enhance a Speech Emotion Recognition

【24h】

Segment Repetition Based on High Amplitude to Enhance a Speech Emotion Recognition

机译：基于高振幅的片段重复增强语音情感识别

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech Emotion Recognition (SER) is a technology developed on a computer to realize a Human-Computer Interaction (HCI). It is a challenging task since the lack of data. Some data augmentation methods have been created to increase the data variation, but they do not significantly improve accuracy. Therefore, a new additional data augmentation method called Segment Repetition based on High Amplitude (SRHA) is proposed to solve this problem. This method makes some repetitions on the segments that have the highest amplitude. An experiment of 10 times data augmentation, using five standard augmentations and the additional SRHA with a Long Short-Term Memory (LSTM) as the classifier, shows that the proposed SRHA significantly increases the SER accuracy from 95.88% to 98.16%. Other experiments for 20 and 40 times data augmentations also show that the SRHA outperforms the five standard augmentations. These indicate that the SRHA is a powerful data augmentation method for SER.

机译：语音情感识别（SER）是在计算机上开发的一种用于实现人机交互（HCI）的技术。由于缺乏数据，这是一项具有挑战性的任务。已经创建了一些数据增强方法来增加数据变化，但是它们并未显着提高准确性。因此，提出了一种新的附加数据增强方法，称为基于高振幅的分段重复（SRHA），以解决此问题。该方法在幅度最大的段上进行一些重复。使用五种标准扩充以及带有长短期记忆（LSTM）的附加SRHA作为分类器的10倍数据扩充实验表明，提出的SRHA将SER准确度从95.88％显着提高到98.16％。其他针对20和40倍数据增强的实验也表明SRHA优于五个标准增强。这些表明SRHA是SER强大的数据增强方法。

著录项

来源
《Procedia Computer Science》 |2019年第22期|共7页
作者
Bagas Adi Prayitno; Suyanto Suyanto;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
data augmentationhigh amplitudelong short-term memorysegment repetitionspeech emotion recognition;

机译：数据增强高振幅长短期记忆段重复语音情感识别;

相似文献

外文文献
中文文献
专利

1. Segment Repetition Based on High Amplitude to Enhance a Speech Emotion Recognition [J] . Bagas Adi Prayitno, Suyanto Suyanto Procedia Computer Science . 2019,第1期

机译：基于高振幅的片段重复增强语音情感识别
2. Segment-based emotion recognition from continuous Mandarin Chinese speech [J] . Jun-Heng Yeh, Tsang-Long Pao, Ching-Yi Lin, Computers in Human Behavior . 2011,第5期

机译：基于连续汉语语音的基于段的情感识别
3. Segmenting into Adequate Units for Automatic Recognition of Emotion-Related Episodes: A Speech-Based Approach [J] . AntonBatliner, DinoSeppi, StefanSteidl, Advances in human-computer interaction . 2010,第1期

机译：分割成足够的单元以自动识别与情感相关的情节：基于语音的方法
4. Prosodic feature based speech emotion recognition at segmental and supra segmental levels [C] . Jacob Agnes, Mythili P. IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems . 2015

机译：分段和超分段水平的基于韵律特征的语音情感识别
5. Explicit N-best formant features for segment-based speech recognition. [D] . Schmid, Philipp Heinz. 1996

机译：基于段的语音识别的显式N最佳共振峰特征。
6. Development of a Two-Stage Procedure for the Automatic Recognition of Dysfluencies in the Speech of Children Who Stutter: II. ANN Recognition of Repetitions and Prolongations With Supplied Word Segment Markers [O] . Peter Howell, Stevie Sackin, Kazan Glenn -1

机译：自动识别口吃儿童言语中流离失所的两阶段程序的发展：II。具有提供的词段标记的ANN识别重复和延长
7. Segmenting into adequate units for automatic recognition of emotion-related episodes: a speech-based approach [O] . Anton Batliner, Stefan Steidl, Björn Schuller 2010

机译：分割成足够的单元以自动识别与情感相关的情节：基于语音的方法

Segment Repetition Based on High Amplitude to Enhance a Speech Emotion Recognition

摘要

著录项

相似文献

相关主题

期刊订阅