Environmental sound recognition using short-time feature aggregation

Roma Gerard; Herrera Perfecto; Nogueira Waldo

首页> 外文期刊>Journal of Intelligent Information Systems >Environmental sound recognition using short-time feature aggregation

【24h】

Environmental sound recognition using short-time feature aggregation

机译：使用短时特征聚合的环境声音识别

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Recognition of environmental sound is usually based on two main architectures, depending on whether the model is trained with frame-level features or with aggregated descriptions of acoustic scenes or events. The former architecture is appropriate for applications where target categories are known in advance, while the later affords a less supervised approach. In this paper, we propose a framework for environmental sound recognition based on blind segmentation and feature aggregation. We describe a new set of descriptors, based on Recurrence Quantification Analysis (RQA), which can be extracted from the similarity matrix of a time series of audio descriptors. We analyze their usefulness for recognition of acoustic scenes and events in addition to standard feature aggregation. Our results show the potential of non-linear time series analysis techniques for dealing with environmental sounds.

机译：对环境声音的识别通常基于两个主要体系结构，这取决于模型是使用帧级特征还是对声学场景或事件的综合描述进行训练。前一种体系结构适用于事先知道目标类别的应用，而后一种体系则提供较少监督的方法。在本文中，我们提出了一种基于盲分割和特征聚合的环境声音识别框架。我们基于递归量化分析（RQA）描述了一组新的描述符，可以从音频描述符的时间序列的相似性矩阵中提取该描述符。除了标准特征聚合外，我们还分析了它们在识别声场和事件方面的有用性。我们的结果显示了非线性时间序列分析技术在处理环境声音方面的潜力。

著录项

来源
《Journal of Intelligent Information Systems》 |2018年第3期|457-475|共19页
作者
Roma Gerard; Herrera Perfecto; Nogueira Waldo;
展开▼
作者单位

Georgia Inst Technol, Sch Literature Media & Commun, Atlanta, GA 30332 USA;

Univ Pompeu Fabra, Mus Technol Grp, Barcelona, Spain;

Hannover Med Sch, Dept Otolaryngol, Hannover, Germany;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Audio databases; Event detection; Environmental sound recognition; Audio features; Recurrence quantification analysis; Pattern recognition;

机译：音频数据库;事件检测;环境声音识别;音频功能;递归量化分析;模式识别;

相似文献

外文文献
中文文献
专利

1. Environmental sound recognition using short-time feature aggregation [J] . P. Jouvelot Computing reviews . 2020,第1期

机译：使用短时特征聚合的环境声音识别
2. Environmental sound recognition using short-time feature aggregation [J] . P. Jouvelot Computing reviews . 2019,第5期

机译：使用短时特征聚合的环境声音识别
3. Environmental sound recognition using short-time feature aggregation [J] . P. Jouvelot Computing reviews . 2019,第5期

机译：使用短时特征聚合的环境声音识别
4. Deep bottleneck features and sound-dependent i-vectors for simultaneous recognition of speech and environmental sounds [C] . Sakriani Sakti, Seiji Kawanishi, Graham Neubig, IEEE Workshop on Spoken Language Technology . 2016

机译：瓶颈深的功能和依赖于声音的i向量，可同时识别语音和环境声音
5. Recognition and characterization of unstructured environmental sounds [D] . Chu, Selina 2011

机译：识别和表征非结构化的环境声音
6. Zipf's Law in Short-Time Timbral Codings of Speech, Music, and Environmental Sound Signals [O] . Martín Haro, Joan Serrà, Perfecto Herrera, 2009

机译：语音，音乐和环境声音信号的短时音色编码的Zipf定律
7. Feature Selection Using Genetic Algorithms for the Generation of a Recognition and Classification of Children Activities Model Using Environmental Sound [O] . Antonio García-Dominguez, Carlos E. Galván-Tejada, Laura A. Zanella-Calzada, 2020

机译：使用遗传算法来使用环境声音来产生儿童活动模型的识别和分类的特征选择
8. Identifying Significant Environmental Features Using Feature Recognition. [R] . White, M., Zhu, J., Blandford, B., 2015

机译：使用特征识别识别重要的环境特征。

Environmental sound recognition using short-time feature aggregation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅