【24h】

HOW EFFICIENT IS MPEG-7 FOR GENERAL SOUND RECOGNITION?

机译:MPEG-7在一般声音识别方面的效率如何?

获取原文
获取原文并翻译 | 示例

摘要

Our challenge is to analyze/classify video sound track content for indexing purposes. To this end we compare the performance of MPEG-7 Audio Spectrum Projection (ASP) features based on several basis decomposition algorithms vs. Mel-scale Frequency Cepstrum Coefficients (MFCC). For basis decomposition in the feature extraction we evaluate three approaches: Principal Component Analysis (PCA), Independent Component Analysis (ICA), and Non-negative Matrix Factorization (NMF). Audio features are computed from these reduced vectors and are fed into a continuous hidden Markov model (CHMM) classifier. Our conclusion is that established MFCC features yield better performance compared to MPEG-7 ASP in the general sound recognition under practical constraints.
机译:我们的挑战是分析/分类视频音轨内容以建立索引。为此,我们比较了基于几种基本分解算法与梅尔级频率倒谱系数(MFCC)的MPEG-7音频频谱投影(ASP)功能的性能。对于特征提取中的基础分解,我们评估三种方法:主成分分析(PCA),独立成分分析(ICA)和非负矩阵分解(NMF)。音频特征是从这些缩减的矢量计算得出的,并被输入到连续的隐马尔可夫模型(CHMM)分类器中。我们的结论是,在实际的约束下,与常规的声音识别相比,已建立的MFCC功能比MPEG-7 ASP具有更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号