首页> 外文会议>International Symposium on Signal Processing and Information Technology >Robust Speaker Diarization in a Multi-Speaker Environment Using Autocorrelation-based Noise Subtraction
【24h】

Robust Speaker Diarization in a Multi-Speaker Environment Using Autocorrelation-based Noise Subtraction

机译:使用基于自相关的噪声减法的多扬声器环境中强大的扬声器日复速度

获取原文

摘要

This paper shows research performed into the topic of speaker diarization for multi-speaker environment. It looks into the algorithms and the implementation of an off-line speaker segmentation and indexing system for recorded speech data where usually more than one speaker is present. Speaker diarization is a well studied topic in the domain of broadcast news recordings. Most of the proposed systems involve hierarchical clustering of the data, where the number of speakers and their identities are known a priori. Speaker diarization is the task of assigning a unique label to all speech segments in an audio stream by the same speaker. There are two key challenges: processing speed and robustness in the presence of noise. In this paper we address the robustness issue by using a method already successful in speech recognition application. Using ANS (Autocorrelation-Based Noise Subtraction) for robust genetic algorithm-based speaker diarization, we compare the results with the baseline MFCC-based system in clean and noisy conditions.
机译:本文显示了对多扬声器环境的扬声器日益改估主题进行的研究。它调查算法和用于录制语音数据的离线扬声器分段和索引系统的实现,其中通常存在多于一个扬声器。扬声器日益改估是广播新闻录制领域的良好学习主题。大多数建议系统涉及数据的分层聚类,其中扬声器的数量及其身份已知先验。扬声器日流是通过同一扬声器分配给音频流中的所有语音段的唯一标签的任务。有两个关键挑战:在噪音存在下处理速度和稳健性。在本文中,我们通过使用在语音识别应用程序中已经成功的方法来解决鲁棒性问题。使用ANS(基于自相关的噪声减法)用于基于鲁棒的遗传算法的扬声器日期,我们将结果与基于基于基于基于基于基于基于基于基于基于基于MFCC的系统进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号