首页> 外文会议>Fifth International Conference on Computer and Information Technology(CIT 2005) >Location and Extraction of Broadcast in News Video Based on QGMM and BIC
【24h】

Location and Extraction of Broadcast in News Video Based on QGMM and BIC

机译:基于QGMM和BIC的新闻视频广播定位与提取

获取原文
获取原文并翻译 | 示例

摘要

An algorithm on location and extraction of broadcast in news video is proposed in this paper. Firstly, input audio stream is divided into speech and non- speech segments by VQ (Vector Quantification) after a set of new features representing audio segments' time-variant characteristics are extracted, including HZCRR (High Zero-crossing Rate Ratio), LSTER (Low Short-time Energy Ratio) and HBFERR (High Basic-frequency-energy Rate Ratio), etc. Then a QGMM (Quasi Gaussian Mixture Model) is presented to describe the speaker's identity and BIC (Bayesian Information Criterion) is used to detect speaker change. Finally speaker clustering is carried out with BIC, and location and extraction of broadcast is realized based on rules. Satisfactory results from experiments prove the effectiveness of this algorithm.
机译:提出了一种新闻视频广播的定位与提取算法。首先,在提取了代表音频片段时变特性的一组新特征之后,通过VQ(矢量量化)将输入音频流分为语音片段和非语音片段,包括HZCRR(高过零率比),LSTER(低短时能量比)和HBFERR(高基频能量比)等。然后,提出了QGMM(准高斯混合模型)来描述说话者的身份,并使用BIC(贝叶斯信息准则)来检测说话者更改。最后利用BIC对说话人进行聚类,并根据规则实现广播的定位和提取。实验结果令人满意,证明了该算法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号