【24h】

A Simple But Effective Approach to Speaker Tracking in Broadcast News

机译:一种简单但有效的广播新闻发言人跟踪方法

获取原文
获取原文并翻译 | 示例

摘要

The automatic transcription of broadcast news and meetings involves the segmentation, identification and tracking of speaker turns during each session, which is known as speaker diarization. This paper presents a simple but effective approach to a slightly different task, called speaker tracking, also involving audio segmentation and speaker identification, but with a subset of known speakers, which allows to estimate speaker models and to perform identification on a segment-by-segment basis. The proposed algorithm segments the audio signal in a fully unsu-pervised way, by locating the most likely change points from an purely acoustic point of view. Then the available speaker data are used to estimate single-Gaussian acoustic models. Finally, speaker models are used to classify the audio segments by choosing the most likely speaker or, alternatively, the Other category, if none of the speakers is likely enough. Despite its simplicity, the proposed approach yielded the best performance in the speaker tracking challenge organized in November 2006 by the Spanish Network on Speech Technology.
机译:广播新闻和会议的自动转录涉及在每个会话期间对讲话者轮番进行分段,识别和跟踪,这被称为讲话者二值化。本文提出了一种简单而有效的方法来处理稍有不同的任务,称为说话人跟踪,它还涉及音频分割和说话人识别,但是还包含一个已知说话人的子集,该子集允许估计说话人模型并逐段执行识别。细分基础。通过从纯声学的角度定位最可能的变化点,提出的算法以完全不受监督的方式分割音频信号。然后,可用的说话者数据将用于估计单高斯声学模型。最后,如果没有一个说话人可能足够,则使用说话人模型通过选择最可能的说话人或其他类别来对音频片段进行分类。尽管方法简单,但在西班牙语音技术网络于2006年11月组织的演讲者跟踪挑战赛中,该方法仍取得了最佳性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号