首页> 外文期刊>Signal Processing, IET >Extraction of adaptive wavelet packet filter-bank-based acoustic feature for speech emotion recognition
【24h】

Extraction of adaptive wavelet packet filter-bank-based acoustic feature for speech emotion recognition

机译:基于自适应小波包滤波器组的声学特征的语音情感识别

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, a wavelet packet (WP)-based acoustic feature extraction approach is proposed for automatic speech emotion recognition (SER). First, the issue of optimising the WP filter-bank structure for giving classification task is presented as a tree pruning problem, and different tree-pruning criteria are investigated. On this basis, a novel WP-based feature is introduced for SER, namely discriminative band WP power coefficients. Finally, a SER system is built and extensive experiments are carried out. Experimental results show that the proposed feature considerably improves emotion recognition performance over conventional mel frequency cepstrum coefficient (MFCC) feature. The proposed feature extraction approach is promising since it can be easily extended to two-dimensional (2D) facial expression analysis with 2D WP quadtree structures, and further a high-quality audio-visual bimodal emotion recognition system is desirable.
机译:本文提出了一种基于小波包(WP)的声学特征提取方法,用于自动语音情感识别(SER)。首先,将优化WP滤波器库结构以提供分类任务的问题作为树修剪问题提出,并研究了不同的树修剪标准。在此基础上,针对SER引入了一种新颖的基于WP的特征,即判别频带WP功率系数。最后,建立了SER系统并进行了广泛的实验。实验结果表明,与传统的mel频率倒谱系数(MFCC)特征相比,该特征大大提高了情感识别性能。所提出的特征提取方法是有希望的,因为它可以容易地扩展到具有2D WP四叉树结构的二维(2D)面部表情分析,并且进一步需要一种高质量的视听双峰情感识别系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号