首页> 外文会议>European Conference on Speech Communication and Technology >A FAST, ACCURATE AND STREAM-BASED SPEAKER SEGMENTATION AND CLUSTERING ALGORITHM
【24h】

A FAST, ACCURATE AND STREAM-BASED SPEAKER SEGMENTATION AND CLUSTERING ALGORITHM

机译:一种快速,准确,基于流的扬声器分段和聚类算法

获取原文

摘要

In this paper a new pre-processor for a free speech transcription system is described. It performs a speech/non-speech partition, a segmentation of the speech parts into speaker turns, and a clustering of the speaker turns. It works in a stream-based mode, and it is aiming for a high accuracy with a low delay and processing time. Experiments on the Hub4 Broadcast News corpus show that the newly proposed pre-processor is competitive with and in some respects better than the best systems published so far. The paper also describes attempts to raise the system performance by supplementing the standard MFCC features with prosodic features such as pitch and voicing evidence.
机译:在本文中,描述了一种用于自由语音转录系统的新的预处理器。它执行语音/非语音分区,将语音部分分段为扬声器转动,以及扬声器转弯的聚类。它以基于流的模式工作,它的目标是具有低延迟和处理时间的高精度。 Hub4广播新闻语料库的实验表明,新提出的预处理器与迄今为止发布的最佳系统更竞争。本文还介绍了通过补充标准MFCC功能的韵律特征,例如音高和发声证据等标准MFCC功能来提高系统性能的尝试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号