Audio-based automatic detection of objectionable contents in noisy conditions using normalized segmental two-dimesional MFCC

机译：使用归一化分段二维MFCC在嘈杂条件下基于音频的自动检测不良内容

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The segmental two-dimensional Mel-frequency cepstral coefficient (STDMFCC) feature has been successfully used in recent studies to detect objectionable sounds, which implicitly represent both static and dynamic characteristics of signal. This study now proposes a new normalized STDMFCC to improve the content recognition performance in diverse noisy environments. Two tests were conducted to verify the performance of the proposed feature: First, an objectionable sound recognition test was conducted with 10-second clips to which white noises with diverse signal-to-noise ratios (SNRs) were added. The proposed feature in the test had an average error reduction rate (ERR) of 24.69% with respect to the STDMFCC. Second, a test was conducted based on the soundtrack that contained diverse channel environments and noises. The equal error rate (EER) of the proposed feature was 4.00% compared with 10.33% of STDMFCC, and the ERR was 61.29%.

机译：分段二维梅尔频率倒谱系数（STDMFCC）功能已在最近的研究中成功地用于检测令人反感的声音，这些声音隐含地代表了信号的静态和动态特性。现在，这项研究提出了一种新的归一化STDMFCC，以提高在各种嘈杂环境中的内容识别性能。进行了两项测试以验证所提出功能的性能：首先，使用10秒的剪辑进行了令人反感的声音识别测试，其中添加了具有不同信噪比（SNR）的白噪声。测试中提出的功能相对于STDMFCC具有24.69％的平均错误减少率（ERR）。其次，基于包含不同通道环境和噪声的音轨进行了测试。与STDMFCC的10.33％相比，该功能的均等错误率（EER）为4.00％，而ERR为61.29％。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP》|2012年|p.481- 484|共4页
会议地点 Kyoto(JP)
作者
Kim, Bong-Wan;
展开▼
作者单位

SiTEC Wonkwang Univ. Korea;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Audio-Based Objectionable Content Detection Using Discriminative Transforms of Time-Frequency Dynamics [J] . Kim M. J., Kim H. Multimedia, IEEE Transactions on . 2012,第5期

机译：使用时频动态判别变换的基于音频的有害内容检测
2. Improving short utterance speaker verification by combining MFCC and Entrocy in Noisy conditions [J] . Al-karawi Khamis A., Mohammed Duraid Y. Multimedia Tools and Applications . 2021,第14期

机译：通过在嘈杂的条件下结合MFCC和entocy改进短语扬声器验证
3. Robust Speech Recognition System Using Conventional and Hybrid Features of MFCC, LPCC, PLP, RASTA-PLP and Hidden Markov Model Classifier in Noisy Conditions [J] . Veton Z. K?puska, Hussien A. Elharati Journal of Computer and Communications . 2015,第6期

机译：噪声条件下使用MFCC，LPCC，PLP，RASTA-PLP和隐马尔可夫模型分类器的常规和混合特征的鲁棒语音识别系统
4. Audio-based automatic detection of objectionable contents in noisy conditions using normalized segmental two-dimesional MFCC [C] . Kim Bong-Wan IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：基于音频的自动检测官方化分段二维MFCC噪声条件下的令人反感的内容
5. Exploration of automatic speech recognition for Russian in noisy conditions [D] . Vabishchevich, Pavel. 2015

机译：诺斯条件下俄罗斯自动演讲识别探索
6. Robust EEG-Based Decoding of Auditory Attention With High-RMS-Level Speech Segments in Noisy Conditions [O] . Lei Wang, Ed X. Wu, Fei Chen 2020

机译：基于危险的eeg的eeg的解码在嘈杂的条件下具有高rms级语音段的听觉注意力
7. Analysis of an Automatic Text Content Extraction Approach in Noisy Video Images [O] . C. P. Sumathi, N. Priya 2014

机译：噪声视频图像中自动文本内容提取方法的分析

Audio-based automatic detection of objectionable contents in noisy conditions using normalized segmental two-dimesional MFCC

摘要

著录项

相似文献

相关主题

期刊订阅