首页> 外文会议>International Conference on Engineering MIS >A system for speaker detection and tracking in audio broadcast news
【24h】

A system for speaker detection and tracking in audio broadcast news

机译:一种用于扬声器检测和跟踪音频广播新闻的系统

获取原文
获取外文期刊封面目录资料

摘要

A system for speaker-based audio indexing and for speaker tracking in broadcast news audio is presented. Several tasks which are treated as a multistage process construct the process of producing indexing information in continuous audio streams based on detected speakers. The main constructing blocks of such an indexing system contain components for an audio segmentation, speaker detection, speaker clustering, and speaker identification. In the proposed speaker-based audio indexing system, three probabilistic Linear Disciminant Analysis (PLDA) variants-standard, simplified and two-covariance-, and Gaussian Mixture Model (GMM) are proposed in the speaker identification stage. The evaluation is performed on audio data from the broadcast news domain and the obtained results demonstrate the superiority of two-covariance PLDA model in terms of performance results compared to other proposed algorithms.
机译:提出了一种用于基于说话者的音频索引和用于广播新闻音频中的说话者跟踪的系统。被视为多阶段过程的几个任务构成了基于检测到的说话者在连续音频流中生成索引信息的过程。这样的索引系统的主要构造块包含用于音频分割,说话者检测,说话者聚类和说话者识别的组件。在提出的基于说话者的音频索引系统中,在说话者识别阶段提出了三种概率线性判别分析(PLDA)变体-标准,简化和二次协方差以及高斯混合模型(GMM)。对来自广播新闻领域的音频数据进行了评估,所获得的结果证明了与其他拟议算法相比,二协方差PLDA模型在性能结果方面的优越性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号