【24h】

GMM-UBM based open-set online speaker diarization

机译:基于GMM-UBM的开放式在线扬声器二元化

获取原文

摘要

In this paper, we present an open-set online speaker diarization system. The system is based on Gaussian mixture models (GMMs), which are used as speaker models. The system starts with just 3 such models (one each for both genders and one for non-speech) and creates models for individual speakers not till the speakers occur. As more and more speakers appear, more models are created. Our system implicitly performs audio segmentation, speechon-speech classification, gender recognition and speaker identification. The system is tested with the HUB4-1996 radio broadcast news database.
机译:在本文中,我们提出了一种开放式的在线说话者二值化系统。该系统基于用作说话者模型的高斯混合模型(GMM)。系统仅以3种这样的模型开始(每种模型分别用于性别和非语音),并为单个讲话者创建模型,直到出现讲话者为止。随着越来越多的扬声器出现,将创建更多模型。我们的系统隐式执行音频分割,语音/非语音分类,性别识别和说话人识别。该系统已通过HUB4-1996广播新闻数据库进行了测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号