首页> 外文会议>International Congress and Exposition on Noise Control Engineering >Objectively Choosing Spectrogram Parameters to Classify Environmental Noises
【24h】

Objectively Choosing Spectrogram Parameters to Classify Environmental Noises

机译:客观地选择谱图参数以分类环境噪音

获取原文

摘要

Spectrograms are commonly used to visualize, analyze, and classify audio signals in the same way that social media companies (e.g., Google, Facebook, Yahoo) use images to classify or tag people in photos. A problem unique to using spectrograms to classify acoustic signals is that the user must choose the spectrogram input-parameters, which may affect the accuracy of the resulting classifier. While the spectrogram - in its simplest form - only has three input-parameters, each parameter has a large number of possible values it can take, resulting in a nearly infinite number of combinations and unique spectrograms. The three input-parameters include the window-type, window-size, and percent-overlap-between-windows. The process of choosing spectrogram parameters, however, is often glossed over in the literature, and there is typically little guidance on how to make this, often, subjective choice. We hypothesize that the choice of spectrogram input-parameters will affect the spectrogram output or features that in turn will affect the performance of the acoustic classifiers. To test this hypothesis, we use Matlab's built-in spectrogram function, a support-vector-machine classifier, a labeled (i.e., human classified) environmental noise dataset, and randomly sample the spectrogram input-parameter space to objectively choose the spectrogram input-parameters. We find that the random sampling procedure is a useful way of choosing the spectrogram input-parameters, and finding the spectrogram features that are the most important for classifying environment noises. The environmental noises used in this study include the noise from air conditioners, car horns, children playing, dogs barking, drilling, engine idling, gunshots, jackhammers, sirens, and street music.
机译:谱图通常用于以相同的方式可视化,分析和分类音频信号(例如,谷歌,Facebook,雅虎)使用图像来分类或标记照片中的人。使用频谱图以分类声信号的问题是用户必须选择频谱图输入参数,这可能会影响所得分类器的准确性。虽然频谱图 - 以最简单的形式 - 只有三个输入参数,但每个参数都有大量可能需要的值,导致几乎无限数量的组合和唯一的谱图。三个输入参数包括窗口类型,窗口大小和窗口之间的百分比 - 窗口。然而,选择频谱图参数的过程通常在文献中掩盖,并且通常对如何制造这一点的指导很少,通常是主观选择。我们假设频谱图输入参数的选择将影响频谱图输出或功能,从而影响声学分类器的性能。为了测试这个假设,我们使用MATLAB的内置频谱图功能,支持矢量机分类器,标记(即人类分类)环境噪声数据集,随机采样频谱图输入参数空间,以客观地选择频谱图 - 参数。我们发现随机采样过程是选择频谱图输入参数的有用方式,并找到对分类环境噪声最重要的频谱图功能。本研究中使用的环境噪音包括来自空调,汽车角,儿童玩耍,狗吠叫,钻孔,发动机怠速,枪声,黑手道,警报和街头音乐的噪音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号