Two-level fusion-based acoustic scene classification

Waldekar Shefali; Saha Goutam

首页> 外文期刊>Applied Acoustics >Two-level fusion-based acoustic scene classification

【24h】

Two-level fusion-based acoustic scene classification

机译：基于两级融合的声学场景分类

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Growing demands from applications like surveillance, archiving, and context-aware devices have fuelled research towards efficient extraction of useful information from environmental sounds. Assigning a textual label to an audio segment based on the general characteristics of locations or situations is dealt with in acoustic scene classification (ASC). Because of the different nature of audio scenes, a single feature-classifier pair may not efficiently discriminate among environments. Also, the acoustic scenes might vary with the problem under investigation. However, for most of the ASC applications, rather than giving explicit scene labels (like home, park, etc.) a general estimate of the type of surroundings (e.g., indoor or outdoor) might be enough. In this paper, we propose a two-level hierarchical framework for ASC wherein finer labels follow coarse classification. At the first level, texture features extracted from time-frequency representation of the audio samples are used to generate the coarse labels. The system then explores combinations of six well-known spectral features, successfully used in different audio processing fields for second level classification to give finer details of the audio scene. The performance of the proposed system is compared with baseline methods using detection and classification of acoustic scenes and events (DCASE, 2016 and 2017) ASC databases, and found to be superior in terms of classification accuracy. Additionally, the proposed hierarchical method provides important intermediate results as coarse labels that may be useful in certain applications. (C) 2020 Elsevier Ltd. All rights reserved.

机译：从监视，归档和环境知识设备等应用程序的需求增长促进了从环境声音有效提取有用信息的研究。在声学场景分类（ASC）中，将基于位置或情况的一般特征分配给音频段的文本标签。由于音频场景的不同性质，单个特征分类器对可能在环境中有效地区分。此外，声学场景可能因调查问题而异。然而，对于大多数ASC应用，而不是给出明确的场景标签（如家庭，公园等）的一般估计周围的类型（例如，室内或室外）可能就足够了。在本文中，我们为ASC提出了一个双层分层框架，其中更精细的标签遵循粗略分类。在第一级别，从音频样本的时频表示中提取的纹理特征用于生成粗标签。然后，系统探讨了六种众所周知的频谱特征的组合，以用于第二级分类的不同音频处理字段中，以提供更精细的音频场景细节。使用声学场景和事件的检测和分类（DCASE，2016和2017）ASC数据库的检测和分类，将所提出的系统的性能与基线方法进行比较，并发现在分类准确性方面是优越的。另外，所提出的分层方法将重要的中间结果提供作为在某些应用中可能有用的粗标记。（c）2020 elestvier有限公司保留所有权利。

著录项

来源
《Applied Acoustics》 |2020年第12期|107502.1-107502.11|共11页
作者
Waldekar Shefali; Saha Goutam;
展开▼
作者单位

IIT Kharagpur Dept Elect & Elect Commun Engn Kharagpur W Bengal India;

IIT Kharagpur Dept Elect & Elect Commun Engn Kharagpur W Bengal India;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Environmental acoustics; Hierarchical classification; Score-fusion; Spectral features; Texture features;

机译：环境声学;分层分类;得分融合;光谱功能;纹理特征;

相似文献

外文文献
中文文献
专利

1. Two-Level Feature Representation for Aerial Scene Classification [J] . Jinrui Gan, Qingyong Li, Zhen Zhang, IEEE Geoscience and Remote Sensing Letters . 2016,第11期

机译：空中场景分类的两级特征表示
2. Investigation of acoustic and visual features for acoustic scene classification [J] . Xie Jie, Zhu Mingying Expert Systems with Application . 2019,第JULa期

机译：声学和视觉特征的声学场景分类研究
3. Semisupervised Two-Level Fusion-Based Autoencoded Approach for Low-Cost Domain Adaptation of Remotely Sensed Images [J] . Chakraborty Shounak, Roy Moumita, Melganie Farid IEEE Geoscience and Remote Sensing Letters . 2019,第7期

机译：基于半监督两级融合的自动编码方法用于遥感图像的低成本域自适应
4. High-Resolution Attention Network with Acoustic Segment Model for Acoustic Scene Classification [C] . Xue Bai, Jun Du, Jia Pan, IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：带有声音片段模型的高分辨率注意力网络在声音场景分类中的应用
5. Resynthesis of Urban Acoustic Scenes Based on Acoustic Event Detection. [D] . You, Jaeseong. 2017

机译：基于声事件检测的城市声场景再合成。
6. Multi-View Fusion-Based 3D Object Detection for Robot Indoor Scene Perception [O] . Li Wang, Ruifeng Li, Jingwen Sun, 2019

机译：基于多视图融合的3D目标检测技术用于机器人室内场景感知
7. Acoustic scene classification based on generative model of acoustic spatial words for distributed microphone array [O] . Keisuke Imoto, Nobutaka Ono 2017

机译：基于分布式麦克风阵列声空间单词生成模型的声学场景分类

Two-level fusion-based acoustic scene classification

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅