...
首页> 外文期刊>EURASIP journal on applied signal processing >Audio Classification in Speech and Music: A Comparison Between a Statistical and a Neural Approach
【24h】

Audio Classification in Speech and Music: A Comparison Between a Statistical and a Neural Approach

机译:语音和音乐中的音频分类:统计方法和神经方法之间的比较

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We focus the attention on the problem of audio classification in speech and music for multimedia applications. In particular, we present a comparison between two different techniques for speech/music discrimination. The first method is based on zero crossing rate and Bayesian classification. It is very simple from a computational point of view, and gives good results in case of pure music or speech. The simulation results show that some performance degradation arises when the music segment contains also some speech superimposed on music, or strong rhythmic components. To overcome these problems, we propose a second method, that uses more features, and is based on neural networks (specifically a multi-layer Perceptron). In this case we obtain better performance, at the expense of a limited growth in the computational complexity. In practice, the proposed neural network is simple to be implemented if a suitable polynomial is used as the activation function, and a real-time implementation is possible even if low-cost embedded systems are used.
机译:我们将注意力集中在多媒体应用的语音和音乐中的音频分类问题上。特别是,我们提出了两种不同的语音/音乐歧视技术之间的比较。第一种方法基于零交叉率和贝叶斯分类。从计算的角度来看,它非常简单,并且在纯音乐或语音的情况下也能提供良好的效果。仿真结果表明,当音乐片段中还包含一些叠加在音乐上的语音或强烈的节奏成分时,会导致性能下降。为了克服这些问题,我们提出了第二种方法,该方法使用更多功能,并且基于神经网络(特别是多层感知器)。在这种情况下,我们获得了更好的性能,但以有限的计算复杂度为代价。实际上,如果使用合适的多项式作为激活函数,则所提出的神经网络很容易实现,即使使用低成本嵌入式系统,也可以实时实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号