Objectively Choosing Spectrogram Parameters to Classify Environmental Noises

机译：客观地选择谱图参数以分类环境噪音

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Spectrograms are commonly used to visualize, analyze, and classify audio signals in the same way that social media companies (e.g., Google, Facebook, Yahoo) use images to classify or tag people in photos. A problem unique to using spectrograms to classify acoustic signals is that the user must choose the spectrogram input-parameters, which may affect the accuracy of the resulting classifier. While the spectrogram - in its simplest form - only has three input-parameters, each parameter has a large number of possible values it can take, resulting in a nearly infinite number of combinations and unique spectrograms. The three input-parameters include the window-type, window-size, and percent-overlap-between-windows. The process of choosing spectrogram parameters, however, is often glossed over in the literature, and there is typically little guidance on how to make this, often, subjective choice. We hypothesize that the choice of spectrogram input-parameters will affect the spectrogram output or features that in turn will affect the performance of the acoustic classifiers. To test this hypothesis, we use Matlab's built-in spectrogram function, a support-vector-machine classifier, a labeled (i.e., human classified) environmental noise dataset, and randomly sample the spectrogram input-parameter space to objectively choose the spectrogram input-parameters. We find that the random sampling procedure is a useful way of choosing the spectrogram input-parameters, and finding the spectrogram features that are the most important for classifying environment noises. The environmental noises used in this study include the noise from air conditioners, car horns, children playing, dogs barking, drilling, engine idling, gunshots, jackhammers, sirens, and street music.

机译：谱图通常用于以相同的方式可视化，分析和分类音频信号（例如，谷歌，Facebook，雅虎）使用图像来分类或标记照片中的人。使用频谱图以分类声信号的问题是用户必须选择频谱图输入参数，这可能会影响所得分类器的准确性。虽然频谱图 - 以最简单的形式 - 只有三个输入参数，但每个参数都有大量可能需要的值，导致几乎无限数量的组合和唯一的谱图。三个输入参数包括窗口类型，窗口大小和窗口之间的百分比 - 窗口。然而，选择频谱图参数的过程通常在文献中掩盖，并且通常对如何制造这一点的指导很少，通常是主观选择。我们假设频谱图输入参数的选择将影响频谱图输出或功能，从而影响声学分类器的性能。为了测试这个假设，我们使用MATLAB的内置频谱图功能，支持矢量机分类器，标记（即人类分类）环境噪声数据集，随机采样频谱图输入参数空间，以客观地选择频谱图 - 参数。我们发现随机采样过程是选择频谱图输入参数的有用方式，并找到对分类环境噪声最重要的频谱图功能。本研究中使用的环境噪音包括来自空调，汽车角，儿童玩耍，狗吠叫，钻孔，发动机怠速，枪声，黑手道，警报和街头音乐的噪音。

著录项

来源
《International Congress and Exposition on Noise Control Engineering》|2016年|794p|共8页
会议地点
作者
Edward T. Nykaza; Anton Netchaev; Steven Bunkley; Matthew G. Blevins;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TB53-53;
关键词
Spectrogram; Classification; Environmental noise;

机译：谱图;分类;环境噪音;
入库时间 2022-08-21 04:38:22

相似文献

外文文献
中文文献
专利

1. Environmental noise classifier using a new set of feature parameters based on pitch range [J] . Buket D. Barkana, Burak Uzkent Applied Acoustics . 2011,第11期

机译：使用基于音高范围的一组新特征参数的环境噪声分类器
2. Repeated double cross-validation for choosing a single solution in evolutionary multi-objective fuzzy classifier design [J] . Hisao Ishibuchi, Yusuke Nojima Knowledge-Based Systems . 2013,第deca期

机译：在进化多目标模糊分类器设计中选择单个解决方案的重复双重交叉验证
3. An Objective Parameter to Classify Voice Signals Based on Variation in Energy Distribution [J] . Liu Boquan, Polce Evan, Jiang Jack Journal of voice: official journal of the Voice Foundation . 2019,第5期

机译：基于能量分布变化对语音信号进行分类的目标参数
4. Objectively Choosing Spectrogram Parameters to Classify Environmental Noises [C] . Edward T. Nykaza, Anton Netchaev, Steven Bunkley, International Congress and Exposition on Noise Control Engineering . 2016

机译：客观地选择谱图参数以分类环境噪音
5. Reducing Uncertainty in Estimates of Environmental Parameters From Ambient Noise Using Statistical Array Processing. [D] . Menon, Ravishankar. 2013

机译：使用统计数组处理减少来自环境噪声的环境参数估计值的不确定性。
6. An objective parameter for quantifying the turbulent noise portion of voice signals [O] . Liyu Lin, William Calawerts, Keith Dodd, -1

机译：用于量化语音信号的湍流噪声部分的客观参数
7. Pareto-optimality and a search for robustness: choosing solutions with desired properties in objective space and parameter space [O] . Gift Dumedah, Aaron A. Berg, Mark Wineberg 2011

机译：帕累托 - 最优性和搜索鲁棒性：选择客观空间和参数空间中具有所需属性的解决方案

Objectively Choosing Spectrogram Parameters to Classify Environmental Noises

摘要

著录项

相似文献

相关主题

期刊订阅