A NEW APPROACH FOR AUDIO CLASSIFICATION AND SEGMENTATION USING GABOR WAVELETS AND FISHER LINEAR DISCRIMINATOR

RUEI-SHIANG LIN; LING-HWEI CHEN

首页> 外文期刊>International Journal of Pattern Recognition and Artificial Intelligence >A NEW APPROACH FOR AUDIO CLASSIFICATION AND SEGMENTATION USING GABOR WAVELETS AND FISHER LINEAR DISCRIMINATOR

【24h】

A NEW APPROACH FOR AUDIO CLASSIFICATION AND SEGMENTATION USING GABOR WAVELETS AND FISHER LINEAR DISCRIMINATOR

机译：利用Gabor小波和Fisher线性判别器进行音频分类的新方法

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Rapid increase in the amount of audio data demands an efficient method to automatically segment or classify audio stream based on its content. In this paper, based on the Gabor wavelet features, an audio classification and segmentation method is proposed. This method will first divide an audio stream into clips, each of which contains one-second audio information. Then, each clip is classified as one of two classes or five classes. Two classes contain speech and music; pure speech, pure music, song, speech with music background, and speech with environmental noise background are for five classes. Finally, a merge technique is provided to do segmentation. In order to make the proposed method robust for a variety of audio sources, we use Fisher Linear Discriminator to obtain features with the highest discriminative ability. Experimental results show that the proposed method can achieve over 98% accuracy rate for speech and music discrimination, and more than 95% for a five-way discrimination. By checking the class types of adjacent clips, we can also identify more than 95% audio scene breaks in audio sequence.

机译：音频数据量的快速增长需要一种有效的方法，该方法可以根据音频流的内容自动对音频流进行分段或分类。基于Gabor小波特征，提出了一种音频分类和分割方法。此方法将首先将音频流分成多个片段，每个片段包含一秒钟的音频信息。然后，每个片段被分类为两个类别或五个类别之一。语音和音乐两节课；纯语音，纯音乐，歌曲，具有音乐背景的语音和具有环境噪声背景的语音适用于五个类别。最后，提供了一种合并技术来进行分割。为了使所提出的方法对各种音频源都具有鲁棒性，我们使用Fisher线性鉴别器来获得具有最高鉴别能力的特征。实验结果表明，该方法在语音和音乐识别中可以达到98％以上的准确率，在五向识别中可以达到95％以上。通过检查相邻剪辑的类类型，我们还可以识别音频序列中超过95％的音频场景中断。

著录项

来源
《International Journal of Pattern Recognition and Artificial Intelligence》 |2005年第6期|p.807-822|共16页
作者
RUEI-SHIANG LIN; LING-HWEI CHEN;
展开▼
作者单位

Department of Computer and Information Science, National Chiao Tung University 1001 Ta Hsueh Rd., Hsinchu, Taiwan 30050, R.O.C.;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
audio classification and segmentation; spectrogram; audio content-based retrieval; fisher linear discriminator; gabor wavelets;

机译：音频分类与分割频谱图基于音频内容的检索Fisher线性鉴别器Gabor小波;

相似文献

外文文献
中文文献
专利

1. GNN-CRC: Discriminative Collaborative Representation-Based Classification via Gabor Wavelet Transformation and Nearest Neighbor [J] . ZHANG Yanghao, ZENG Shaoning, ZENG Wei, 上海交通大学学报（英文版） . 2018,第005期
2. Retinal Vessel Segmentation Using Supervised Classification Based on Multi-Scale Vessel Filtering and Gabor Wavelet [J] . Tang Songyuan, Lin Tong, Yang Jian, Journal of Medical Imaging and Health Informatics . 2015,第7期

机译：基于多尺度血管滤波和Gabor小波的监督分类视网膜血管分割
3. Retinal Vessel Segmentation Using the 2-D Gabor Wavelet and Supervised Classification [J] . Soares J.V.B., Leandro J.J.G., Cesar R.M. Jr., IEEE Transactions on Medical Imaging . 2006,第9期

机译：使用二维Gabor小波和监督分类的视网膜血管分割
4. Comparative study of different spatial/spatial-frequency methods (Gabor filters, wavelets, wavelets packets) for texture segmentation/classification [C] . Vautrot, P., Bonnet, Image Processing, 1996. Proceedings., International Conference on . 1996

机译：不同的空间/空间频率方法（Gabor滤波器，小波，小波包）进行纹理分割/分类的比较研究
5. Automated ventricular measurements using Gabor wavelets. [D] . Sampath, Hemalatha. 2007

机译：使用Gabor小波自动进行心室测量。
6. Auto-Weighted Multi-View Discriminative Metric Learning Method With Fisher Discriminative and Global Structure Constraints for Epilepsy EEG Signal Classification [O] . Jing Xue, Xiaoqing Gu, Tongguang Ni 2020

机译：具有Fisher判别和癫痫eEG信号分类的Fisher判别和全局结构约束的自动加权多视图判别度量学习方法
7. A new approach for audio classification and segmentation using Gabor wavelets and Fisher linear discriminator [O] . Ruei-shiang Lin, Ling-hwei Chen 2005

机译：一种利用Gabor小波和Fisher线性鉴别器进行音频分类和分割的新方法

A NEW APPROACH FOR AUDIO CLASSIFICATION AND SEGMENTATION USING GABOR WAVELETS AND FISHER LINEAR DISCRIMINATOR

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅