Urban sound event classification based on local and global features aggregation

Ye Jiaxing; Kobayashi Takumi; Murakawa Masahiro

首页> 外文期刊>Applied Acoustics >Urban sound event classification based on local and global features aggregation

【24h】

Urban sound event classification based on local and global features aggregation

机译：基于局部和全局特征聚合的城市声音事件分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The automatic content-based classification of complex and dynamic urban sound is an important aspect of various emerging applications, such as surveillance, urban soundscape understanding and noise source identification, therefore the research topic has gained a lot of attention in recent years. The aim of this paper is to develop efficient machine learning-based scheme for urban sound classification in real-life noise conditions. Unlike conventional sound event classification methods that mainly address local temporal-spectral patterns, we propose an aggregation scheme to combine both local and global acoustic features. For characterizing local patterns, we employ feature learning method to extract class-dependent temporal-spectral structures; on the other hand, long-term descriptive statistics are employed to exploit global features of sound events, e.g. variability and recurrence, which also carry rich discriminant information. In order to aggregate the heterogeneous acoustic information for classification, we introduce mixture of experts model (MoE) which effectively formulates relationship between local and global information. At validation stage, we conduct experiments on UrbanSound8K database which consists of 10 categories of urban sound events with 8732 real-world clips. It is noteworthy that the 10 classes of crowdsourced recordings, including air conditioner, car horn, children playing, dog bark, drilling, engine idling, gunshot, jackhammer, siren and street music, are most common urban sounds closely related to urban life. According to experimental results, the proposed scheme achieved superior performance compared with 3 other latest approaches and it can be a fundamental building block of various urban multimedia information processing systems that help to improve quality of life. (C) 2016 Elsevier Ltd. All rights reserved.

机译：基于内容的自动，复杂，动态的城市声音分类是各种新兴应用的重要方面，例如监视，城市声景理解和噪声源识别，因此，近年来的研究主题受到了广泛关注。本文的目的是为现实的噪声条件下的城市声音分类开发一种有效的基于机器学习的方案。与主要针对局部时谱模式的常规声音事件分类方法不同，我们提出了一种组合局部和全局声学特征的聚合方案。为了表征局部模式，我们采用特征学习方法来提取依赖于类的时域光谱结构。另一方面，采用长期描述性统计数据来开发声音事件的整体特征，例如可变性和复发性，也包含丰富的判别信息。为了聚合异类声学信息以进行分类，我们引入了专家模型（MoE）混合模型，该模型有效地表达了本地信息和全局信息之间的关系。在验证阶段，我们在UrbanSound8K数据库上进行实验，该数据库包含10类城市声音事件以及8732个真实剪辑。值得注意的是，包括空调，汽车喇叭，儿童游戏，狗吠，钻探，发动机空转，枪击，手提钻，警笛和街头音乐在内的10类众包录音是与城市生活密切相关的最常见的城市声音。根据实验结果，与其他3种最新方法相比，该方案具有更好的性能，并且可以成为各种城市多媒体信息处理系统的基本组成部分，有助于改善生活质量。（C）2016 Elsevier Ltd.保留所有权利。

著录项

来源
《Applied Acoustics》 |2017年第ptab期|246-256|共11页
作者
Ye Jiaxing; Kobayashi Takumi; Murakawa Masahiro;
展开▼
作者单位

Natl Inst Adv Ind Sci & Technol, Tsukuba, Ibaraki, Japan;

Natl Inst Adv Ind Sci & Technol, Tsukuba, Ibaraki, Japan;

Natl Inst Adv Ind Sci & Technol, Tsukuba, Ibaraki, Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Urban sound classification; Crowd-sourcing; Signal processing; Machine learning; Dictionary learning; Feature aggregation;

机译：城市声音分类;众包;信号处理;机器学习;词典学习;特征集合;

相似文献

外文文献
中文文献
专利

1. Urban sound classification based on 2-order dense convolutional network using dual features [J] . Applied Acoustics . 2020,第Jula期

机译：基于双重特征的二阶密集卷积网络的城市声音分类
2. An automated snoring sound classification method based on local dual octal pattern and iterative hybrid feature selector [J] . Tuncer Turker, Akbal Erhan, Dogan Sengul Biomedical signal processing and control . 2021,第Jana期

机译：一种基于局部双八大模式和迭代混合特征选择器的自动打鼾声音分类方法
3. Global-feature classification can be acquired more rapidly than local-feature classification in both humans and pigeons [J] . Goto K, Wills AJ, Lea SEG Animal Cognition . 2004,第2期

机译：在人类和鸽子中，全局特征分类都可以比局部特征分类更快地获取
4. Sound event classification based on Feature Integration, Recursive Feature Elimination and Structured Classification [C] . Tran, H.D., Haizhou Li IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP 2009 . 2009

机译：基于特征集成，递归特征消除和结构化分类的声音事件分类
5. Exploring image and video by classification and clustering on global and local visual features. [D] . Lu, Le. 2007

机译：通过对全局和局部视觉特征进行分类和聚类来探索图像和视频。
6. Multi-Feature Classification of Multi-Sensor Satellite Imagery Based on Dual-Polarimetric Sentinel-1A Landsat-8 OLI and Hyperion Images for Urban Land-Cover Classification [O] . Tao Zhou, Zhaofu Li, Jianjun Pan 2018

机译：基于双极化Sentinel-1ALandsat-8 OLI和Hyperion图像的多传感器卫星图像的多特征分类用于城市土地覆盖分类
7. Recognition of Urban Sound Events using Deep Context-Aware Feature Extractors and Handcrafted Features [O] . Theodoros Giannakopoulos, Stavros Perantonis 2018

机译：使用深层上下文感知功能提取器和手动功能识别城市声音事件

Urban sound event classification based on local and global features aggregation

摘要

著录项

相似文献

相关主题

期刊订阅