HOW EFFICIENT IS MPEG-7 FOR GENERAL SOUND RECOGNITION?

机译：MPEG-7在一般声音识别方面的效率如何？

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Our challenge is to analyze/classify video sound track content for indexing purposes. To this end we compare the performance of MPEG-7 Audio Spectrum Projection (ASP) features based on several basis decomposition algorithms vs. Mel-scale Frequency Cepstrum Coefficients (MFCC). For basis decomposition in the feature extraction we evaluate three approaches: Principal Component Analysis (PCA), Independent Component Analysis (ICA), and Non-negative Matrix Factorization (NMF). Audio features are computed from these reduced vectors and are fed into a continuous hidden Markov model (CHMM) classifier. Our conclusion is that established MFCC features yield better performance compared to MPEG-7 ASP in the general sound recognition under practical constraints.

机译：我们的挑战是分析/分类视频音轨内容以建立索引。为此，我们比较了基于几种基本分解算法与梅尔级频率倒谱系数（MFCC）的MPEG-7音频频谱投影（ASP）功能的性能。对于特征提取中的基础分解，我们评估三种方法：主成分分析（PCA），独立成分分析（ICA）和非负矩阵分解（NMF）。音频特征是从这些缩减的矢量计算得出的，并被输入到连续的隐马尔可夫模型（CHMM）分类器中。我们的结论是，在实际的约束下，与常规的声音识别相比，已建立的MFCC功能比MPEG-7 ASP具有更好的性能。

著录项

来源
《Audio Engineering Society(AES) International Conference: Metadata for Audio; 20040617-19; London(GB)》|2004年|P.156-161|共6页
会议地点 London(GB)
作者
HYOUNG-GOOK KIM; JUAN JOSE BURRED; THOMAS SIKORA;
展开▼
作者单位

Communication Systems Group, Technical University of Berlin, Germany;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类电声器件;电声技术和语音信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. MPEG-7 sound-recognition tools [J] . Casey M. IEEE Transactions on Circuits and Systems for Video Technology . 2001,第6期

机译：MPEG-7声音识别工具
2. Robust Environmental Sound Recognition With Sparse Key-Point Encoding and Efficient Multispike Learning [J] . Yu Qiang, Yao Yanli, Wang Longbiao, Neural Networks and Learning Systems, IEEE Transactions on . 2021,第2期

机译：强大的环境声音识别与稀疏关键点编码和高效的多分层学习
3. Energy Efficient Animal Sound Recognition Scheme in Wireless Acoustic Sensors Networks [J] . Saad Al Ahmadi, Badour AlMulhem International Journal of Wireless & Mobile Networks . 2020,第4期

机译：无线声学传感器网络中的节能动物声音识别方案
4. HOW EFFICIENT IS MPEG-7 FOR GENERAL SOUND RECOGNITION? [C] . HYOUNG-GOOK KIM, JUAN JOSE BURRED, THOMAS SIKORA International Conference on Metadata for Audio . 2005

机译：MPEG-7有多少量的声音识别？
5. Biomimetic spike-based algorithms and hardware for sound classification, localization, and speech recognition. [D] . Pu, Yirong. 2011

机译：基于仿生峰值的算法和硬件，用于声音分类，定位和语音识别。
6. What and Where in Auditory Sensory Processing: A High-Density Electrical Mapping Study of Distinct Neural Processes Underlying Sound Object Recognition and Sound Localization [O] . Victoria M. Leavitt, Sophie Molholm, Manuel Gomez-Ramirez, 2011

机译：听觉感觉处理中的什么和何处：声音对象识别和声音定位背后不同神经过程的高密度电映射研究
7. MPEG-7 Sound Recognition Tools [O] . 2008

机译：MPEG-7声音识别工具

HOW EFFICIENT IS MPEG-7 FOR GENERAL SOUND RECOGNITION?

摘要

著录项

相似文献

相关主题

期刊订阅