Unsupervised environmental sound recognition

机译：无监督的环境声音识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Environmental sound recognition is an audio scene identification process to locate a person by analyzing the background sound. This paper deals with the prototype modeling of environmental sound recognition that is based on unsupervised learning. The unsupervised learning finds a hidden structure in a group of data given as input. There is no need of a label to which the input data belongs. So this could be used for the practical cases. Sound recognition involves the collection of audio data, extraction of significant features and finding a common structure between them, thus leading to grouping of the data. The Mel frequency cepstrum coefficients are extracted. These features are used for clustering by a Gaussian mixture model which is a probabilistic model. The clustering leads to the identification of the correct audio scene. The implementation is done with the help of MATLAB and ModelSim. Five major environmental sounds which include the sound of car, office, restaurant, street, subway are considered. The parameters of the Gaussian mixture model are estimated in the training phase. The model is tested with the inputs considering the parameters. The MATLAB implementation shows an efficiency of 98%. The hardware implementation of the same shows an efficiency of 96.4%.

机译：环境声音识别是通过分析背景声音来定位人的音频场景识别过程。本文讨论了基于无监督学习的环境声音识别原型模型。无监督学习在作为输入给出的一组数据中找到隐藏的结构。不需要输入数据所属的标签。因此，可以将其用于实际情况。声音识别涉及音频数据的收集，重要特征的提取以及在它们之间找到通用结构，从而导致数据分组。提取梅尔频率倒谱系数。这些特征通过高斯混合模型（一种概率模型）用于聚类。聚类导致正确音频场景的识别。该实现是在MATLAB和ModelSim的帮助下完成的。考虑了五种主要的环境声音，包括汽车，办公室，餐厅，街道，地铁的声音。在训练阶段估算高斯混合模型的参数。使用考虑参数的输入对模型进行测试。 MATLAB实现显示效率为98％。相同的硬件实现显示效率为96.4％。

著录项

来源
《2014 International Conference on Embedded Systems》|2014年|44-48|共5页
会议地点 Coimbatore(IN)
作者
Mohanapriya S.P.; Karthika R.;
展开▼
作者单位

Electronics Communication Engineering Amrita Vishwa Vidyapeetham Coimbatore-641 112 India;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Gaussian Mixture Model; Mel frequency cepstrum co-efficient; unsupervised learning;

机译：高斯混合模型；梅尔频率倒谱系数；无监督学习；;

相似文献

外文文献
中文文献
专利

1. Unsupervised feature learning for environmental sound classification using Weighted Cycle-Consistent Generative Adversarial Network [J] . Esmaeilpour Mohammad, Cardinal Patrick, Koerich Alessandro Lameiras Applied Soft Computing . 2020,第期

机译：使用加权循环一致的生成对抗网络进行环境声音分类的无监督特征
2. Robust Environmental Sound Recognition With Sparse Key-Point Encoding and Efficient Multispike Learning [J] . Yu Qiang, Yao Yanli, Wang Longbiao, Neural Networks and Learning Systems, IEEE Transactions on . 2021,第2期

机译：强大的环境声音识别与稀疏关键点编码和高效的多分层学习
3. Generative Model Driven Representation Learning in a Hybrid Framework for Environmental Audio Scene and Sound Event Recognition [J] . IEEE transactions on multimedia . 2020,第1期

机译：用于环境音频场景和声音事件识别的混合框架中的生成模型驱动表示学习
4. Unsupervised environmental sound recognition [C] . Mohanapriya S.P., Karthika R. International Conference on Embedded Systems . 2014

机译：无监督的环境声音识别
5. Recognition and characterization of unstructured environmental sounds [D] . Chu, Selina 2011

机译：识别和表征非结构化的环境声音
6. Environmental Sounds Recognition in Children with Cochlear Implants [O] . Shu-Yu Liu, Tien-Chen Liu, Ya-Ling Teng, -1

机译：人工耳蜗儿童的环境声音识别
7. Sound-Imitation Word Recognition for Environmental Sounds: Disambiguation in Determining Phonemes of Sound-Imitation Words [O] . Kazushi Ishihara, Kazunori Komatani, Tetsuya Ogata, 2005

机译：用于环境声音的声音仿制词：在确定声音仿词音素时歧义

Unsupervised environmental sound recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅