首页> 外文期刊>International Journal of Pattern Recognition and Artificial Intelligence >A NEW APPROACH FOR AUDIO CLASSIFICATION AND SEGMENTATION USING GABOR WAVELETS AND FISHER LINEAR DISCRIMINATOR
【24h】

A NEW APPROACH FOR AUDIO CLASSIFICATION AND SEGMENTATION USING GABOR WAVELETS AND FISHER LINEAR DISCRIMINATOR

机译:利用Gabor小波和Fisher线性判别器进行音频分类的新方法

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Rapid increase in the amount of audio data demands an efficient method to automatically segment or classify audio stream based on its content. In this paper, based on the Gabor wavelet features, an audio classification and segmentation method is proposed. This method will first divide an audio stream into clips, each of which contains one-second audio information. Then, each clip is classified as one of two classes or five classes. Two classes contain speech and music; pure speech, pure music, song, speech with music background, and speech with environmental noise background are for five classes. Finally, a merge technique is provided to do segmentation. In order to make the proposed method robust for a variety of audio sources, we use Fisher Linear Discriminator to obtain features with the highest discriminative ability. Experimental results show that the proposed method can achieve over 98% accuracy rate for speech and music discrimination, and more than 95% for a five-way discrimination. By checking the class types of adjacent clips, we can also identify more than 95% audio scene breaks in audio sequence.
机译:音频数据量的快速增长需要一种有效的方法,该方法可以根据音频流的内容自动对音频流进行分段或分类。基于Gabor小波特征,提出了一种音频分类和分割方法。此方法将首先将音频流分成多个片段,每个片段包含一秒钟的音频信息。然后,每个片段被分类为两个类别或五个类别之一。语音和音乐两节课;纯语音,纯音乐,歌曲,具有音乐背景的语音和具有环境噪声背景的语音适用于五个类别。最后,提供了一种合并技术来进行分割。为了使所提出的方法对各种音频源都具有鲁棒性,我们使用Fisher线性鉴别器来获得具有最高鉴别能力的特征。实验结果表明,该方法在语音和音乐识别中可以达到98%以上的准确率,在五向识别中可以达到95%以上。通过检查相邻剪辑的类类型,我们还可以识别音频序列中超过95%的音频场景中断。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号