HOW EFFICIENT IS MPEG-7 FOR GENERAL SOUND RECOGNITION?

机译：MPEG-7有多少量的声音识别？

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Our challenge is to analyze/classify video sound track content for indexing purposes. To this end we compare the performance of MPEG-7 Audio Spectrum Projection (ASP) features based on several basis decomposition algorithms vs. Mel-scale Frequency Cepstrum Coefficients (MFCC). For basis decomposition in the feature extraction we evaluate three approaches: Principal Component Analysis (PCA), Independent Component Analysis (ICA), and Non-negative Matrix Factorization (NMF). Audio features are computed from these reduced vectors and are fed into a continuous hidden Markov model (CHMM) classifier. Our conclusion is that established MFCC features yield better performance compared to MPEG-7 ASP in the general sound recognition under practical constraints.

机译：我们的挑战是分析/分类视频声道内容以获取索引目的。为此，我们基于几个基础分解算法与MEL级频率谱系数（MFCC）进行了基于多个基础分解算法的MPEG-7音频频谱投影（ASP）特征的性能。对于特征提取中的基础分解，我们评估三种方法：主成分分析（PCA），独立分析（ICA）和非负矩阵分解（NMF）。音频功能由这些缩小的矢量计算，并被馈入连续隐藏的马尔可夫模型（CHMM）分类器。我们的结论是，与实际约束下的一般声音识别中的MPEG-7 ASP相比，已建立的MFCC功能会产生更好的性能。

著录项

来源
《International Conference on Metadata for Audio》|2005年||共6页
会议地点
作者
HYOUNG-GOOK KIM; JUAN JOSE BURRED; THOMAS SIKORA;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912.34-53;
关键词

相似文献

外文文献
中文文献
专利

1. MPEG-7 sound-recognition tools [J] . Casey M. IEEE Transactions on Circuits and Systems for Video Technology . 2001,第6期

机译：MPEG-7声音识别工具
2. Robust Environmental Sound Recognition With Sparse Key-Point Encoding and Efficient Multispike Learning [J] . Yu Qiang, Yao Yanli, Wang Longbiao, Neural Networks and Learning Systems, IEEE Transactions on . 2021,第2期

机译：强大的环境声音识别与稀疏关键点编码和高效的多分层学习
3. Energy Efficient Animal Sound Recognition Scheme in Wireless Acoustic Sensors Networks [J] . Saad Al Ahmadi, Badour AlMulhem International Journal of Wireless & Mobile Networks . 2020,第4期

机译：无线声学传感器网络中的节能动物声音识别方案
4. HOW EFFICIENT IS MPEG-7 FOR GENERAL SOUND RECOGNITION? [C] . HYOUNG-GOOK KIM, JUAN JOSE BURRED, THOMAS SIKORA International Conference on Metadata for Audio . 2005

机译：MPEG-7有多少量的声音识别？
5. Biomimetic spike-based algorithms and hardware for sound classification, localization, and speech recognition. [D] . Pu, Yirong. 2011

机译：基于仿生峰值的算法和硬件，用于声音分类，定位和语音识别。
6. What and Where in Auditory Sensory Processing: A High-Density Electrical Mapping Study of Distinct Neural Processes Underlying Sound Object Recognition and Sound Localization [O] . Victoria M. Leavitt, Sophie Molholm, Manuel Gomez-Ramirez, 2011

机译：听觉感觉处理中的什么和何处：声音对象识别和声音定位背后不同神经过程的高密度电映射研究
7. MPEG-7 Sound Recognition Tools [O] . 2008

机译：MPEG-7声音识别工具

HOW EFFICIENT IS MPEG-7 FOR GENERAL SOUND RECOGNITION?

摘要

著录项

相似文献

相关主题

期刊订阅