首页> 外文期刊>Applied Acoustics >Urban sound event classification based on local and global features aggregation
【24h】

Urban sound event classification based on local and global features aggregation

机译:基于局部和全局特征聚合的城市声音事件分类

获取原文
获取原文并翻译 | 示例
       

摘要

The automatic content-based classification of complex and dynamic urban sound is an important aspect of various emerging applications, such as surveillance, urban soundscape understanding and noise source identification, therefore the research topic has gained a lot of attention in recent years. The aim of this paper is to develop efficient machine learning-based scheme for urban sound classification in real-life noise conditions. Unlike conventional sound event classification methods that mainly address local temporal-spectral patterns, we propose an aggregation scheme to combine both local and global acoustic features. For characterizing local patterns, we employ feature learning method to extract class-dependent temporal-spectral structures; on the other hand, long-term descriptive statistics are employed to exploit global features of sound events, e.g. variability and recurrence, which also carry rich discriminant information. In order to aggregate the heterogeneous acoustic information for classification, we introduce mixture of experts model (MoE) which effectively formulates relationship between local and global information. At validation stage, we conduct experiments on UrbanSound8K database which consists of 10 categories of urban sound events with 8732 real-world clips. It is noteworthy that the 10 classes of crowdsourced recordings, including air conditioner, car horn, children playing, dog bark, drilling, engine idling, gunshot, jackhammer, siren and street music, are most common urban sounds closely related to urban life. According to experimental results, the proposed scheme achieved superior performance compared with 3 other latest approaches and it can be a fundamental building block of various urban multimedia information processing systems that help to improve quality of life. (C) 2016 Elsevier Ltd. All rights reserved.
机译:基于内容的自动,复杂,动态的城市声音分类是各种新兴应用的重要方面,例如监视,城市声景理解和噪声源识别,因此,近年来的研究主题受到了广泛关注。本文的目的是为现实的噪声条件下的城市声音分类开发一种有效的基于机器学习的方案。与主要针对局部时谱模式的常规声音事件分类方法不同,我们提出了一种组合局部和全局声学特征的聚合方案。为了表征局部模式,我们采用特征学习方法来提取依赖于类的时域光谱结构。另一方面,采用长期描述性统计数据来开发声音事件的整体特征,例如可变性和复发性,也包含丰富的判别信息。为了聚合异类声学信息以进行分类,我们引入了专家模型(MoE)混合模型,该模型有效地表达了本地信息和全局信息之间的关系。在验证阶段,我们在UrbanSound8K数据库上进行实验,该数据库包含10类城市声音事件以及8732个真实剪辑。值得注意的是,包括空调,汽车喇叭,儿童游戏,狗吠,钻探,发动机空转,枪击,手提钻,警笛和街头音乐在内的10类众包录音是与城市生活密切相关的最常见的城市声音。根据实验结果,与其他3种最新方法相比,该方案具有更好的性能,并且可以成为各种城市多媒体信息处理系统的基本组成部分,有助于改善生活质量。 (C)2016 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号