首页> 外文会议>Chinese Spoken Language Processing; Lecture Notes in Artificial Intelligence; 4274 >UBM Based Speaker Segmentation and Clustering for 2-Speaker Detection
【24h】

UBM Based Speaker Segmentation and Clustering for 2-Speaker Detection

机译:基于UBM的说话人分割和2说话人聚类检测

获取原文
获取原文并翻译 | 示例

摘要

In this paper, a speaker segmentation method based on log-likelihood ratio score (LLRS) over universal background model (UBM) and a speaker clustering method based on difference of log-likelihood scores between two speaker models are proposed. During the segmentation process, the LLRS between two adjacent speech segments over UBM is used as a distance measure, while during the clustering process, the difference of log-likelihood scores between two speaker models is used as a speaker classification criterion. A complete system for NIST 2002 2-speaker task is presented using the methods mentioned above. Experimental results on NIST 2002 Switchboard Cellular speaker segmentation corpus, 1-speaker evaluation corpus and 2-speaker evaluation corpus show the potentiality of the proposed algorithms.
机译:提出了一种基于对数似然比得分(LLRS)的通用背景模型(UBM)说话人分割方法和一种基于对数似然得分差异的说话人聚类方法。在分割过程中,UBM上两个相邻语音片段之间的LLRS被用作距离度量,而在聚类过程中,两个说话者模型之间的对数似然分数的差异被用作说话者分类标准。使用上述方法介绍了用于NIST 2002 2扬声器任务的完整系统。在NIST 2002总机蜂窝电话扬声器分割语料库,1-扬声器评估语料库和2-扬声器评估语料库上的实验结果表明了该算法的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号