...
首页> 外文期刊>Procedia Computer Science >Segment Repetition Based on High Amplitude to Enhance a Speech Emotion Recognition
【24h】

Segment Repetition Based on High Amplitude to Enhance a Speech Emotion Recognition

机译:基于高振幅的片段重复增强语音情感识别

获取原文
           

摘要

Speech Emotion Recognition (SER) is a technology developed on a computer to realize a Human-Computer Interaction (HCI). It is a challenging task since the lack of data. Some data augmentation methods have been created to increase the data variation, but they do not significantly improve accuracy. Therefore, a new additional data augmentation method called Segment Repetition based on High Amplitude (SRHA) is proposed to solve this problem. This method makes some repetitions on the segments that have the highest amplitude. An experiment of 10 times data augmentation, using five standard augmentations and the additional SRHA with a Long Short-Term Memory (LSTM) as the classifier, shows that the proposed SRHA significantly increases the SER accuracy from 95.88% to 98.16%. Other experiments for 20 and 40 times data augmentations also show that the SRHA outperforms the five standard augmentations. These indicate that the SRHA is a powerful data augmentation method for SER.
机译:语音情感识别(SER)是在计算机上开发的一种用于实现人机交互(HCI)的技术。由于缺乏数据,这是一项具有挑战性的任务。已经创建了一些数据增强方法来增加数据变化,但是它们并未显着提高准确性。因此,提出了一种新的附加数据增强方法,称为基于高振幅的分段重复(SRHA),以解决此问题。该方法在幅度最大的段上进行一些重复。使用五种标准扩充以及带有长短期记忆(LSTM)的附加SRHA作为分类器的10倍数据扩充实验表明,提出的SRHA将SER准确度从95.88%显着提高到98.16%。其他针对20和40倍数据增强的实验也表明SRHA优于五个标准增强。这些表明SRHA是SER强大的数据增强方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号