首页> 外文会议>International Conference on Parallel Processing Workshops >Large Scale Environmental Sound Classification Based on Efficient Feature Extraction
【24h】

Large Scale Environmental Sound Classification Based on Efficient Feature Extraction

机译:基于有效特征提取的大规模环境声分类

获取原文

摘要

In recent years, plenty of studies endeavor to analyze the life auditory scenarios via mining non-speech sounds. Conventional audio recognition schemes clearly bound the feature extraction and recognition stages, such as in speech recognition. However, such separation leads to inconsistency in the purposes at each stage. The recognition stage contributes to portray the global data distribution focusing on "relationship" between signal samples. However, such consideration can hardly be embedded into feature extraction process which centered on the local structure, thus, the prominent "relation" information is destroyed. In this paper, we propose a unified acoustic recognition framework taking advantage of primitive feature input without injuring discriminant information and adopting effective classification scheme accordingly. We formulate the sound into subspace representation and initially adopt Grassmannian distance to classify the subspace-indexed non-speech sounds. To validate the proposed framework, we conducted experiments using RWCP Sound Scene Database. The experimental results demonstrated the proposed framework achieved fine recognition performance with high efficiency.
机译:近年来,大量研究致力于通过挖掘非语音声音来分析生活听觉场景。常规的音频识别方案清楚地限制了特征提取和识别阶段,例如语音识别。但是,这种分离导致每个阶段的目的不一致。识别阶段有助于描绘集中于信号样本之间“关系”的全局数据分布。但是,这样的考虑几乎不能嵌入到以局部结构为中心的特征提取过程中,从而破坏了突出的“关系”信息。在本文中,我们提出了一个统一的声音识别框架,该框架利用原始特征输入而不会损害判别信息并相应地采用有效的分类方案。我们将声音表达为子空间表示形式,并首先采用Grassmannian距离对子空间索引的非语音声音进行分类。为了验证所提出的框架,我们使用RWCP声音场景数据库进行了实验。实验结果表明,提出的框架具有较高的识别率和较高的识别效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号