首页> 外文会议>INTERSPEECH 2012 >Combining frame and segment based models for environmental sound classification

【24h】

Combining frame and segment based models for environmental sound classification

机译：基于帧和段的环境声分类模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The paper considers the task of recognizing environmental sounds, which plays a critical role in human's perception of an auditory context in audiovisual materials. A variety of features have been proposed for audio recognition, either frame-based or segmental. Here, we propose a two-stage framework to combine modeling in these two levels. First, the Gaussian Mixture Models(GMMs) are built based on short-term features and preclassification are performed. Then, in the event that the GMMs are not certain about the result, the system engages Support Vector Machines (SVMs) to refine the output hypothesis. In the next stage, the features are combined by taking posterior estimates of GMMs along with segmental features as SVMs' input features. Experiments on the sound dataset show that the proposed framework makes an improvement over the traditional methods.

机译：本文考虑了承认环境声音的任务，这在人类对视听环境中的感知中发挥着关键作用。已经提出了各种特征来进行音频识别，无论是基于帧的还是分段。在这里，我们提出了一个两级框架，以在这两个层面中结合建模。首先，高斯混合模型（GMMS）是基于短期特征的构建，并进行预分散。然后，在GMMS不确定结果的情况下，系统接合支持向量机（SVM）以优化输出假设。在下一个阶段，通过将GMM的后验估计以及SVMS输入特征的分段特征以及SVMS的输入特征来组合该特征。声音数据集的实验表明，所提出的框架通过传统方法改进。

著录项

来源
《INTERSPEECH 2012》|2012年||共4页
会议地点
作者
Pengfei Hu; Wenju Liu; Wei Jiang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 73.4136083;
关键词
environmental sound classification; model combination; GMMs; SVMs;

机译：环境声音分类;模型组合;GMMS;SVMS;
入库时间 2022-08-20 22:09:19

相似文献

外文文献
中文文献
专利

1. A Frame Work for Classification of Multi Class Medical Data based on Deep Learning and Naive Bayes Classification Model [J] . N. Ramesh, G. Lavanya Devi, K Srinivasa Rao International Journal of Information Engineering and Electronic Business . 2020,第1期

机译：基于深度学习和天真贝叶斯分类模型的多类医疗数据分类的框架工作
2. Benchmarks for microstructure-based modelling of sound absorbing rigid-frame porous media [J] . Zielinski Tomasz G., Venegas Rodolfo, Perrot Camille, Journal of Sound and Vibration . 2020,第1期

机译：声音吸收刚性框架多孔介质微观结构基础建模的基准
3. Unit Selection Speech Synthesis Using Frame-Sized Speech Segments and Neural Network Based Acoustic Models [J] . Zhen-Hua Ling, Zhi-Ping Zhou Journal of VLSI signal processing systems for signal, image, and video technology . 2018,第7期

机译：基于帧大小的语音片段和基于神经网络的声学模型的单位选择语音合成
4. Combining frame and segment based models for environmental sound classification [C] . Pengfei Hu, Wenju Liu, Wei Jiang Annual conference of the International Speech Communication Association . 2012

机译：结合基于框架和片段的模型进行环境声音分类
5. On the use of frame and segment-based methods for the detection and classification of speech sounds and features [D] . Hou, Jun 2009

机译：关于使用基于帧和片段的方法对语音和特征进行检测和分类
6. Combining Ramachandran plot and molecular dynamics simulation for structural-based variant classification: Using TP53 variants as model [O] . Benjamin Tam, Siddharth Sinha, San Ming Wang 2020

机译：组合Ramachandran Plot和分子动力学模拟对基于结构的变体分类：使用TP53变体作为模型
7. ESResNet: Environmental Sound Classification Based on Visual Domain Models [O] . Andrey Guzhov, Federico Raue, Jorn Hees, 2021

机译：ESRESNet：基于Visual Domain Models的环境声音分类
8. Sound Classification and Localization Based on Biology Hearing Models and Multiscale Vector Quantization [R] . Baras, J. S. 1999

机译：基于生物听力模型和多尺度矢量量化的声音分类与定位

Combining frame and segment based models for environmental sound classification

摘要

著录项

相似文献

相关主题

期刊订阅