首页> 外文期刊>Journal of Intelligent Information Systems >Environmental sound recognition using short-time feature aggregation
【24h】

Environmental sound recognition using short-time feature aggregation

机译:使用短时特征聚合的环境声音识别

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Recognition of environmental sound is usually based on two main architectures, depending on whether the model is trained with frame-level features or with aggregated descriptions of acoustic scenes or events. The former architecture is appropriate for applications where target categories are known in advance, while the later affords a less supervised approach. In this paper, we propose a framework for environmental sound recognition based on blind segmentation and feature aggregation. We describe a new set of descriptors, based on Recurrence Quantification Analysis (RQA), which can be extracted from the similarity matrix of a time series of audio descriptors. We analyze their usefulness for recognition of acoustic scenes and events in addition to standard feature aggregation. Our results show the potential of non-linear time series analysis techniques for dealing with environmental sounds.
机译:对环境声音的识别通常基于两个主要体系结构,这取决于模型是使用帧级特征还是对声学场景或事件的综合描述进行训练。前一种体系结构适用于事先知道目标类别的应用,而后一种体系则提供较少监督的方法。在本文中,我们提出了一种基于盲分割和特征聚合的环境声音识别框架。我们基于递归量化分析(RQA)描述了一组新的描述符,可以从音频描述符的时间序列的相似性矩阵中提取该描述符。除了标准特征聚合外,我们还分析了它们在识别声场和事件方面的有用性。我们的结果显示了非线性时间序列分析技术在处理环境声音方面的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号