首页> 外文会议>Annual conference of the International Speech Communication Association >Using Blob Detection in Missing Feature Linear-Frequency Cepstral Coefficients for Robust Sound Event Recognition
【24h】

Using Blob Detection in Missing Feature Linear-Frequency Cepstral Coefficients for Robust Sound Event Recognition

机译:在缺少的特征线性频率倒谱系数中使用斑点检测进行稳健的声音事件识别

获取原文

摘要

The proposed Missing Feature Linear-Frequency Cepstral Coefficients (MF-LFCC) is a noise robust cepstral feature that transforms both clean and noisy signals into a similar representation. Unlike conventional Missing Feature Techniques, the MF-LFCC does not require the substitution of spectrogram elements (imputation) or classifier modification (marginalization). To improve the noise mask used in the MF-LFCC, we propose to use the computer vision technique of blob detection to identify the peaks characterizing the sparsity of sound event spectrograms. For single sound event recognition using SVM classifiers, the MF-LFCC is shown to significantly outperform the MFCC baseline and the noise robust ESTI Advanced Front End feature in noisy conditions.
机译:拟议的缺失特征线性倒谱系数(MF-LFCC)是一种噪声稳健的倒谱特征,可将干净的信号和有噪声的信号都转换为相似的表示形式。与传统的缺失特征技术不同,MF-LFCC不需要替换频谱图元素(输入)或分类器修改(边际化)。为了改善MF-LFCC中使用的噪声掩膜,我们建议使用斑点检测的计算机视觉技术来识别表征声音事件频谱图稀疏性的峰值。对于使用SVM分类器的单个声音事件识别,在嘈杂的条件下,MF-LFCC的性能明显优于MFCC基线和抗噪能力强的ESTI高级前端功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号