首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Audio-based automatic detection of objectionable contents in noisy conditions using normalized segmental two-dimesional MFCC
【24h】

Audio-based automatic detection of objectionable contents in noisy conditions using normalized segmental two-dimesional MFCC

机译:使用归一化分段二维MFCC在嘈杂条件下基于音频的自动检测不良内容

获取原文

摘要

The segmental two-dimensional Mel-frequency cepstral coefficient (STDMFCC) feature has been successfully used in recent studies to detect objectionable sounds, which implicitly represent both static and dynamic characteristics of signal. This study now proposes a new normalized STDMFCC to improve the content recognition performance in diverse noisy environments. Two tests were conducted to verify the performance of the proposed feature: First, an objectionable sound recognition test was conducted with 10-second clips to which white noises with diverse signal-to-noise ratios (SNRs) were added. The proposed feature in the test had an average error reduction rate (ERR) of 24.69% with respect to the STDMFCC. Second, a test was conducted based on the soundtrack that contained diverse channel environments and noises. The equal error rate (EER) of the proposed feature was 4.00% compared with 10.33% of STDMFCC, and the ERR was 61.29%.
机译:分段二维梅尔频率倒谱系数(STDMFCC)功能已在最近的研究中成功地用于检测令人反感的声音,这些声音隐含地代表了信号的静态和动态特性。现在,这项研究提出了一种新的归一化STDMFCC,以提高在各种嘈杂环境中的内容识别性能。进行了两项测试以验证所提出功能的性能:首先,使用10秒的剪辑进行了令人反感的声音识别测试,其中添加了具有不同信噪比(SNR)的白噪声。测试中提出的功能相对于STDMFCC具有24.69%的平均错误减少率(ERR)。其次,基于包含不同通道环境和噪声的音轨进行了测试。与STDMFCC的10.33%相比,该功能的均等错误率(EER)为4.00%,而ERR为61.29%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号