首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Acoustic feature extraction by tensor-based sparse representation for sound effects classification
【24h】

Acoustic feature extraction by tensor-based sparse representation for sound effects classification

机译:基于张量的稀疏表示的声音特征提取用于音效分类

获取原文

摘要

This paper describes a method to extract time-frequency (TF) audio features by tensor-based sparse approximation for sound effects classification. In the proposed method, the observed data is encoded as a higher-order tensor and discriminative features are extracted in spectrotemporal domain. Firstly, audio signals are represented by a joint time-frequency-duration tensor based on sparse approximation; then tensor factorization is applied to calculate feature vectors. The three arrays of the proposed tensor are used to represent frequency, time and duration of transient TF atoms respectively. Experimental results show that exploiting tensor representation allows to characterize distinctive transient TF atoms, yielding an average accuracy improvement of 9.7% and 12.5% compared with matching pursuit (MP) and MFCC features.
机译:本文介绍了一种基于张量的稀疏近似来提取时频(TF)音频特征的方法,用于声音效果分类。在提出的方法中,将观测数据编码为高阶张量,并在光谱时域中提取判别特征。首先,音频信号由基于稀疏近似的联合时频持续时间张量表示;然后将张量分解应用于计算特征向量。拟议的张量的三个阵列分别用于表示瞬态TF原子的频率,时间和持续时间。实验结果表明,利用张量表示法可以表征独特的瞬态TF原子,与匹配追踪(MP)和MFCC特征相比,平均准确性提高了9.7%和12.5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号