首页> 外文会议>2018 First Asian Conference on Affective Computing and Intelligent Interaction >Long-short term memory for emotional recognition with variable length speech
【24h】

Long-short term memory for emotional recognition with variable length speech

机译:长期和短期记忆可用于变长语音情感识别

获取原文
获取原文并翻译 | 示例

摘要

Despite many kinds of features using for speech emotional recognition task, they are severely restricted due to the same dimension of features extracting from different length of speech. Therefore, frame-level features reserving temporal information in speech waveform are extracted, whose dimension changes dynamically with the length of original speech. From the perspective of information theory, the information loss of frame- Ievel features is less than that of fixed length, and is more suitable for the input of deep learning with self-learning ability. Bidirectional long-short term memory (BiLSTM) is applied to work as a classifier and process the variable length of features. Experimental results demonstrate that the proposed method significantly outperforms the INTERSPEECH 2010 features on CASIA database.
机译:尽管将多种特征用于语音情感识别任务,但是由于从不同长度的语音中提取的特征的相同维度,它们受到严格限制。因此,提取了在语音波形中保留时间信息的帧级特征,其特征随原始语音的长度而动态变化。从信息论的角度来看,框架级特征的信息损失小于固定长度,更适合具有自学习能力的深度学习输入。双向长期短期内存(BiLSTM)被用作分类器并处理要素的可变长度。实验结果表明,该方法明显优于CASIA数据库上的INTERSPEECH 2010功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号