Audio Classification in Speech and Music: A Comparison Between a Statistical and a Neural Approach

Alessandro Bugatti; Alessandra Flammini; Pierangelo Migliorati

首页> 外文期刊>EURASIP journal on applied signal processing >Audio Classification in Speech and Music: A Comparison Between a Statistical and a Neural Approach

【24h】

Audio Classification in Speech and Music: A Comparison Between a Statistical and a Neural Approach

机译：语音和音乐中的音频分类：统计方法和神经方法之间的比较

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We focus the attention on the problem of audio classification in speech and music for multimedia applications. In particular, we present a comparison between two different techniques for speech/music discrimination. The first method is based on zero crossing rate and Bayesian classification. It is very simple from a computational point of view, and gives good results in case of pure music or speech. The simulation results show that some performance degradation arises when the music segment contains also some speech superimposed on music, or strong rhythmic components. To overcome these problems, we propose a second method, that uses more features, and is based on neural networks (specifically a multi-layer Perceptron). In this case we obtain better performance, at the expense of a limited growth in the computational complexity. In practice, the proposed neural network is simple to be implemented if a suitable polynomial is used as the activation function, and a real-time implementation is possible even if low-cost embedded systems are used.

机译：我们将注意力集中在多媒体应用的语音和音乐中的音频分类问题上。特别是，我们提出了两种不同的语音/音乐歧视技术之间的比较。第一种方法基于零交叉率和贝叶斯分类。从计算的角度来看，它非常简单，并且在纯音乐或语音的情况下也能提供良好的效果。仿真结果表明，当音乐片段中还包含一些叠加在音乐上的语音或强烈的节奏成分时，会导致性能下降。为了克服这些问题，我们提出了第二种方法，该方法使用更多功能，并且基于神经网络（特别是多层感知器）。在这种情况下，我们获得了更好的性能，但以有限的计算复杂度为代价。实际上，如果使用合适的多项式作为激活函数，则所提出的神经网络很容易实现，即使使用低成本嵌入式系统，也可以实时实现。

著录项

来源
《EURASIP journal on applied signal processing》 |2002年第4期|共7页
作者
Alessandro Bugatti; Alessandra Flammini; Pierangelo Migliorati;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类通信;
关键词
speech/music discrimination; indexing of audio-visual documents; neural networks; multimedia applications;

机译：语音/音乐歧视;视听文件索引;神经网络;多媒体应用;

相似文献

外文文献
中文文献
专利

1. Audio Classification in Speech and Music: A Comparison Between a Statistical and a Neural Approach [J] . Alessandro Bugatti, Alessandra Flammini, Pierangelo Migliorati EURASIP journal on applied signal processing . 2002,第4期

机译：语音和音乐中的音频分类：统计方法和神经方法之间的比较
2. Audio Classification in Speech and Music: A Comparison between a Statistical and a Neural Approach [J] . Alessandro Bugatti, Alessandra Flammini, Pierangelo Migliorati EURASIP journal on advances in signal processing . 2002,第4期

机译：语音和音乐中的音频分类：统计方法和神经方法之间的比较
3. AUDIO CLASSIFICATION OF MUSIC/SPEECH MIXED SIGNALS USING SINUSOIDAL MODELING WITH SVM AND NEURAL NETWORK APPROACH [J] . PEJMAN MOWLAEE, ABOLGHASEM SAYADIYAN Journal of Circuits, Systems, and Computers . 2013,第2期

机译：使用SVM和神经网络方法的正弦建模对音乐/语音混合信号进行音频分类
4. Classification of audios containing speech and music [C] . Uzun Erkam, Sencar Husrev Taha . 2012

机译：包含语音和音乐的音频分类
5. Comparison of classification techniques for speech/audio applications. [D] . Shao, Ying. 2003

机译：语音/音频应用分类技术的比较。
6. Tower of London Test: A Comparison between Conventional Statistic Approach and Modelling Based on Artificial Neural Network in Differentiating Fronto-Temporal Dementia from Alzheimer’s Disease [O] . Massimo Franceschi, Paolo Caffarra, Rita Savarè, 2011

机译：伦敦塔测试：传统统计学方法与基于人工神经网络的模型在额颞痴呆症与阿尔茨海默氏病鉴别中的比较
7. Audio Classification in Speech and Music: A Comparison between a Statistical and a Neural Approach [O] . Bugatti Alessandro, Flammini Alessandra, Migliorati Pierangelo 2002

机译：语音和音乐中的音频分类：统计方法和神经方法之间的比较

Audio Classification in Speech and Music: A Comparison Between a Statistical and a Neural Approach

摘要

著录项

相似文献

相关主题

期刊订阅