Large Scale Environmental Sound Classification Based on Efficient Feature Extraction

机译：基于有效特征提取的大规模环境声分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years, plenty of studies endeavor to analyze the life auditory scenarios via mining non-speech sounds. Conventional audio recognition schemes clearly bound the feature extraction and recognition stages, such as in speech recognition. However, such separation leads to inconsistency in the purposes at each stage. The recognition stage contributes to portray the global data distribution focusing on "relationship" between signal samples. However, such consideration can hardly be embedded into feature extraction process which centered on the local structure, thus, the prominent "relation" information is destroyed. In this paper, we propose a unified acoustic recognition framework taking advantage of primitive feature input without injuring discriminant information and adopting effective classification scheme accordingly. We formulate the sound into subspace representation and initially adopt Grassmannian distance to classify the subspace-indexed non-speech sounds. To validate the proposed framework, we conducted experiments using RWCP Sound Scene Database. The experimental results demonstrated the proposed framework achieved fine recognition performance with high efficiency.

机译：近年来，大量研究致力于通过挖掘非语音声音来分析生活听觉场景。常规的音频识别方案清楚地限制了特征提取和识别阶段，例如语音识别。但是，这种分离导致每个阶段的目的不一致。识别阶段有助于描绘集中于信号样本之间“关系”的全局数据分布。但是，这样的考虑几乎不能嵌入到以局部结构为中心的特征提取过程中，从而破坏了突出的“关系”信息。在本文中，我们提出了一个统一的声音识别框架，该框架利用原始特征输入而不会损害判别信息并相应地采用有效的分类方案。我们将声音表达为子空间表示形式，并首先采用Grassmannian距离对子空间索引的非语音声音进行分类。为了验证所提出的框架，我们使用RWCP声音场景数据库进行了实验。实验结果表明，提出的框架具有较高的识别率和较高的识别效率。

著录项

来源
《International Conference on Parallel Processing Workshops》|2016年|421-425|共5页
会议地点
作者
Xiaoyan Wang; Hao Zhou; Zhi Liu; Yu Gu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Spectrogram; Manifolds; Acoustics; Feature extraction; Hidden Markov models; Measurement; Speech recognition;

机译：频谱图;歧管;声学;特征提取;隐马尔可夫模型;测量;语音识别;

相似文献

外文文献
中文文献
专利

1. Signal feature extraction by multi-scale PCA and its application to respiratory sound classification [J] . XieS., JinF., KrishnanS., Medical and Biological Engineering and Computing: Journal of the International Federation for Medical and Biological Engineering . 2012,第7期

机译：多尺度PCA信号特征提取及其在呼吸声分类中的应用
2. Feature Extraction for Hyperspectral Image Classification Based on Scale Invariant Feature Transform-Locality Preserving Projection Algorithm [J] . Chenming Li, Yan Wang, Hongmin Gao, Journal of computational and theoretical nanoscience . 2015,第12期

机译：基于尺度不变特征变换 - 局部节省投影算法的高光谱图像分类特征提取
3. Segmentation-Based Adaptive Feature Extraction Combined With Mahalanobis Distance Classification Criterion for Heart Sound Diagnostic System [J] . Sun Shuping IEEE sensors journal . 2021,第9期

机译：基于分段的自适应特征提取结合Mahalanobis距离分类标准，用于心脏声音诊断系统
4. Large Scale Environmental Sound Classification Based on Efficient Feature Extraction [C] . Xiaoyan Wang, Hao Zhou, Zhi Liu, International Workshop on Embedded Multicore Systems . 2016

机译：基于高效特征提取的大规模环境声音分类
5. Efficient linear and nonlinear feature extraction and its application to fingerprint classification. [D] . Park, Cheong Hee. 2004

机译：高效的线性和非线性特征提取及其在指纹分类中的应用。
6. paraFaceTest: an ensemble of regression tree-based facial features extraction for efficient facial paralysis classification [O] . Jocelyn Barbosa, Woo-Keun Seo, Jaewoo Kang 2019

机译：paraFaceTest：基于回归树的面部特征提取集合用于有效的面部麻痹分类
7. ACOUSTIC FEATURE EXTRACTION BY STATISTICS BASED LOCAL BINARY PATTERN FOR ENVIRONMENTAL SOUND CLASSIFICATION [O] . Takumi Kobayashi, Jiaxing Ye 2015

机译：基于统计的环境声分类局部二元模式的声学特征提取

Large Scale Environmental Sound Classification Based on Efficient Feature Extraction

摘要

著录项

相似文献

相关主题

期刊订阅