Development and Evaluation of Automatic -Speaker based- Audio Identification and Segmentation for Broadcast News Recordings Indexation

机译：基于自动专用识别和广播新闻录音的音频识别和分割的开发与评估

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we describe an automatic- speaker based- audio segmentation and identification system for broadcasted news indexation purposes. We specifically focus on speaker identification and audio scene detection. Speaker identification (SI) is based on the state of the art Gaussian mixture models, whereas scene change detection process uses the classical Bayesian Information Criteria (BIC) and the recently proposed DISTBIC algorithm. In this work, the effectiveness of Mel Frequency Cepstral coefficients MFCC, Linear Predictive Cepstral Coefficients LPCC, and Log Area Ratio LAR coefficients are compared for the purpose of text-independent speaker identification and speaker based audio segmentation. Both the Fisher Discrimination Ratio-feature analysis and performance evaluation in terms of correct identification rate on the TIMIT database showed that the LPCC outperforms the other features especially for low order coefficients. Our experiments on audio segmentation module showed that the DISTBIC segmentation technique is more accurate than the BIC procedure especially in the presence of short segments.

机译：在本文中，我们描述了一种基于自动扬声器的基于音频分割和识别系统，用于广播新闻编分的目的。我们专注于扬声器识别和音频场景检测。扬声器识别（SI）基于现有技术的高斯混合模型的状态，而场景变更检测过程使用经典贝叶斯信息标准（BIC）和最近提出的DISTBIC算法。在这项工作中，比较MEL频率谱系齐系数MFCC，线性预测谱系齐数LPCC和对数面积比LER系数的有效性，以便独立于独立于文本的扬声器识别和基于扬声器的音频分割。在Timit数据库上的正确识别率方面，Fisher辨别比率分析和性能评估都表明，LPCC概率尤其是低阶系数的其他功能。我们在音频分割模块上的实验表明，DISTBIC分段技术比BIC程序更准确，尤其是在短段存在下。

著录项

来源
《International Conference on Information and Communication Technologies》|2006年||共6页
会议地点
作者
Messaoud Bengberabi; Abdenour Sehad;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G20-53;
关键词

相似文献

外文文献
中文文献
专利

1. Development of a Speaker Diarization System for Speaker Tracking in Audio Broadcast News: a Case Study [J] . Mihelic France, Vesnicer Bostjan, Zibert Janez Journal of computing and information technology . 2008,第3期

机译：音频广播新闻中演讲者跟踪的演讲者区分系统的开发：一个案例研究
2. Development Of A Speaker Diarization System For Speaker Tracking In Audio Broadcast News: A Case Study [J] . Janez Zibert, Bostjan Vesnicer, France Mihelic Journal of Computing and Information Technology . 2008,第3期

机译：音频广播新闻中演讲者跟踪的演讲者差异化系统的开发：一个案例研究
3. Automatic multimedia indexing: combining audio, speech, and visual information to index broadcast news [J] . Ohtsuki K., Bessho K., Matsuo Y., IEEE Signal Processing Magazine . 2006,第2期

机译：自动多媒体索引：结合音频，语音和视觉信息以索引广播新闻
4. Development and Evaluation of Automatic -Speaker based- Audio Identification and Segmentation for Broadcast News Recordings Indexation [C] . Messaoud Bengberabi, Abdenour Sehad International Conference on Information and Communication Technologies . 2006

机译：基于自动专用识别和广播新闻录音的音频识别和分割的开发与评估
5. Automatic segmentation, indexing and retrieval of audiovisual data based on combined audio and visual content analysis. [D] . Zhang, Tong. 1999

机译：基于组合的视听内容分析，对视听数据进行自动分段，索引和检索。
6. A device for recording automatic audio tape recording [O] . Martha E. Bernal, Dennis M. Gibson, Donald E. Williams, 1971

机译：用于录制自动录音带的设备
7. AUDIO SOURCE SEGMENTATION USING SPECTRAL CORRELATION FEATURES FOR AUTOMATIC INDEXING OF BROADCAST NEWS [O] . Hayashi Yoshihiko, Ohtsuki Katsutoshi, Mizuno Osamu, 2004

机译：利用谱相关特征对广播新闻进行自动索引的音频源分割

Development and Evaluation of Automatic -Speaker based- Audio Identification and Segmentation for Broadcast News Recordings Indexation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅