Robust unsupervised detection of human screams in noisy acoustic environments

机译：在嘈杂的声学环境中对人的尖叫声进行可靠的无监督检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This study is focused on an unsupervised approach for detection of human scream vocalizations from continuous recordings in noisy acoustic environments. The proposed detection solution is based on compound segmentation, which employs weighted mean distance, T-statistics and Bayesian Information Criteria for detection of screams. This solution also employs an unsupervised threshold optimized Combo-SAD for removal of non-vocal noisy segments in the preliminary stage. A total of five noisy environments were simulated for noise levels ranging from −20dB to +20dB for five different noisy environments. Performance of proposed system was compared using two alternative acoustic front-end features (i) Mel-frequency cepstral coefficients (MFCC) and (ii) perceptual minimum variance distortionless response (PMVDR). Evaluation results show that the new scream detection solution works well for clean, +20, +10 dB SNR levels, with performance declining as SNR decreases to −20dB across a number of the noise sources considered.

机译：这项研究的重点是从嘈杂的声学环境中连续录制的声音中检测人类尖叫声的无监督方法。所提出的检测解决方案基于复合分割，该复合分割采用加权平均距离，T统计和贝叶斯信息准则来检测尖叫。该解决方案还采用了无监督阈值优化的Combo-SAD，用于在初始阶段去除非语音噪声段。对于五个不同的嘈杂环境，总共模拟了五个嘈杂环境，其噪声水平范围为−20dB到+ 20dB。使用两个可选的声学前端功能（i）梅尔频率倒谱系数（MFCC）和（ii）感知最小方差无失真响应（PMVDR）比较了所提出系统的性能。评估结果表明，新的尖叫检测解决方案适用于干净的+ 20，+ 10 dB SNR水平，并且在考虑的多种噪声源中，当SNR降低至−20dB时，性能会下降。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2015年|161-165|共5页
会议地点
作者
Nandwana Mahesh Kumar; Ziaei Ali; Hansen John H.L.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
CompSeg; PMVDR; T; distance; T; -BIC SAD; scream detection;

机译：CompSeg; PMVDR; T;距离; T; -BIC SAD;尖叫检测;

相似文献

外文文献
中文文献
专利

1. Unsupervised Acoustic Model Adaptation Algorithm Using MLLR in a Noisy Environment [J] . Miichi Yamada, Akira Baba, Shinichi Yoshizawa, Electronics and Communications in Japan. Part 3, Fundamental Electronic Science . 2006,第3期

机译：嘈杂环境中使用MLLR的无监督声学模型自适应算法
2. Unsupervised Speaker Adaptation Based on HMM Sufficient Statistics Using Multiple Acoustic Models Under Noisy Environment [J] . Randy Gomez, Akinobu Lee, Hiroshi Saruwatari, 電子情報通信学会技術研究報告. 音声. Speech . 2004,第542期

机译：噪声环境下基于HMM充分统计的多种声学模型的无监督说话人自适应
3. Unsupervised Speaker Adaptation Based on HMM Sufficient Statistics Using Multiple Acoustic Models Under Noisy Environment [J] . Randy GOMEZ, Akinobu LEE, Hiroshi SARUWATARI, 電子情報通信学会技術研究報告. 音声. Speech . 2004,第542期

机译：噪声环境下基于HMM充分统计的多种声学模型的无监督说话人自适应
4. Robust unsupervised detection of human screams in noisy acoustic environments [C] . Nandwana Mahesh Kumar, Ziaei Ali, Hansen John H.L. IEEE International Conference on Acoustics, Speech and Signal Processing . 2015

机译：在嘈杂的声学环境中强大无监督检测人类尖叫声
5. An unsupervised method for speech detection and segmentation in noisy environments using the parametric trajectory model. [D] . Galligan, Shane. 2006

机译：使用参数轨迹模型在嘈杂环境中进行语音检测和分段的无监督方法。
6. Robustness of Auditory Teager Energy Cepstrum Coefficients for Classification of Pathological and Normal Voices in Noisy Environments [O] . Lotfi Salhi, Adnane Cherif 2013

机译：嘈杂环境中听觉Teager能量倒谱系数的病理和正常声音分类的稳健性
7. Scream and Gunshot Detection in Noisy Environments [O] . Gerosa Luigi, Valenzise Giuseppe, Tagliasacchi Marco, 2007

机译：嘈杂环境中的尖叫声和枪声检测

Robust unsupervised detection of human screams in noisy acoustic environments

摘要

著录项

相似文献

相关主题

期刊订阅