首页> 外文期刊>Digital Signal Processing >Classification of audio scenes with novel features in a fused system framework
【24h】

Classification of audio scenes with novel features in a fused system framework

机译:融合系统框架中具有新功能的音频场景分类

获取原文
获取原文并翻译 | 示例
           

摘要

The rapidly increasing requirements from context-aware gadgets, like smartphones and intelligent wearable devices, along with applications such as audio archiving, have given a fillip to the research in the field of Acoustic Scene Classification (ASC). The Detection and Classification of Acoustic Scenes and Events (DCASE) challenges have seen systems addressing the problem of ASC from different directions. Some of them could achieve better results than the Mel Frequency Cepstral Coefficients - Gaussian Mixture Model (MFCC-GMM) baseline system. However, a collective decision from all participating systems was found to surpass the accuracy obtained by each system. The simultaneous use of various approaches can exploit the discriminating information in a better way for audio collected from different environments covering audible-frequency range in varying degrees. In this work, we show that the frame level statistics of some well-known spectral features when fed to Support Vector Machine (SVM) classifier individually, are able to outperform the baseline system of DCASE challenges. Furthermore, we analyzed different methods of combining these features, and also of combining information from two channels when the data is in binaural format. The proposed approach resulted in around 17% and 9% relative improvement in accuracy with respect to the baseline system on the development and evaluation dataset, respectively, from DCASE 2016 ASC task. (C) 2018 Elsevier Inc. All rights reserved.
机译:从上下文知识的小工具(如智能手机和智能可穿戴设备)以及音频存档等应用程序的快速增长要求给出了声场分类(ASC)领域的研究。声学场景和事件(DCASE)挑战的检测和分类已经看到了解决来自不同方向的ASC问题的系统。其中一些可以达到比MEL频率谱系数更好的结果 - 高斯混合模型(MFCC-GMM)基线系统。然而,发现所有参与系统的集体决定超过了每个系统获得的准确性。同时使用各种方法可以以更好的方式利用区分信息,以便从覆盖不同程度的不同环境中收集的音频。在这项工作中,我们表明,一定众所周知的频谱特征的帧级统计,当馈送到支持向量机(SVM)分类器时,能够优于DCES挑战的基线系统。此外,我们分析了组合这些特征的不同方法,以及当数据处于双耳格式时与两个通道组合的信息相结合。所提出的方法分别从DCEAC 2016 ASC任务中分别对开发和评估数据集的基线系统的准确性提高约17%和9%。 (c)2018年Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号