首页> 外文会议>Chinese Spoken Language Processing; Lecture Notes in Artificial Intelligence; 4274 >UBM Based Speaker Segmentation and Clustering for 2-Speaker Detection

【24h】

UBM Based Speaker Segmentation and Clustering for 2-Speaker Detection

机译：基于UBM的说话人分割和2说话人聚类检测

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, a speaker segmentation method based on log-likelihood ratio score (LLRS) over universal background model (UBM) and a speaker clustering method based on difference of log-likelihood scores between two speaker models are proposed. During the segmentation process, the LLRS between two adjacent speech segments over UBM is used as a distance measure, while during the clustering process, the difference of log-likelihood scores between two speaker models is used as a speaker classification criterion. A complete system for NIST 2002 2-speaker task is presented using the methods mentioned above. Experimental results on NIST 2002 Switchboard Cellular speaker segmentation corpus, 1-speaker evaluation corpus and 2-speaker evaluation corpus show the potentiality of the proposed algorithms.

机译：提出了一种基于对数似然比得分（LLRS）的通用背景模型（UBM）说话人分割方法和一种基于对数似然得分差异的说话人聚类方法。在分割过程中，UBM上两个相邻语音片段之间的LLRS被用作距离度量，而在聚类过程中，两个说话者模型之间的对数似然分数的差异被用作说话者分类标准。使用上述方法介绍了用于NIST 2002 2扬声器任务的完整系统。在NIST 2002总机蜂窝电话扬声器分割语料库，1-扬声器评估语料库和2-扬声器评估语料库上的实验结果表明了该算法的潜力。

著录项

来源
《Chinese Spoken Language Processing; Lecture Notes in Artificial Intelligence; 4274 》|2006年|116-125|共10页
会议地点 Singapore(SG)
作者
Jing Deng; Thomas Fang Zheng; Wenhu Wu;
展开▼
作者单位

Center for Speech Technology, Tsinghua National Laboratory for Information Science and Technology, Tsinghua University, Beijing, 100084;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序语言、算法语言 ;
关键词
speaker segmentation; speaker clustering; multi-speaker; speaker detection;

机译：说话人细分；说话者聚类；多人发言；说话人检测;

相似文献

外文文献
中文文献
专利

1. An Effective Speaker Clustering Method using UBM and Ultra-Short Training Utterances [J] . Hossa Robert, Makowski Ryszard Archives of acoustics . 2016 ,第1期

机译：使用UBM和超短训练说话的有效说话人聚类方法
2. Combining cohort and UBM models in open set speaker detection [J] . Anthony Brew, Padraig Cunningham Multimedia Tools and Applications . 2010 ,第1期

机译：在开放式说话人检测中结合队列和UBM模型
3. Generalized Viterbi-based models for time-series segmentation and clustering applied to speaker diarization [J] . Itshak Lapidot, Alon Shoa, Tal Furmanov, Computer speech and language . 2017 ,第Sepa期

机译：基于通用维特比的时间序列分割和聚类模型，用于说话人区分
4. UBM Based Speaker Segmentation and Clustering for 2-Speaker Detection [C] . Jing Deng, Thomas Fang Zheng, Wenhu Wu International Symposium on Chinese Spoken Language Processing . 2006

机译：基于UBM的扬声器分段和聚类，用于2个扬声器检测
5. Semantic Segmentation and Object Detection Based On Active Contour Model and Fuzzy Clustering. [D] . Memar Kouchehbagh, Sara. 2016

机译：基于主动轮廓模型和模糊聚类的语义分割与目标检测。
6. Delaunay Triangulation-Based Spatial Clustering Technique for Enhanced Adjacent Boundary Detection and Segmentation of LiDAR 3D Point Clouds [O] . Jongwon Kim, Jeongho Cho 2019

机译：基于Delaunay三角剖分的空间聚类技术可增强LiDAR 3D点云的相邻边界检测和分割
7. UBM Based Speaker Segmentation and Clustering for 2-Speaker Detection [O] . Jing Deng, Thomas Fang Zheng, Wenhu Wu 2014

机译：基于UBm的扬声器分割和聚类用于双扬声器检测

UBM Based Speaker Segmentation and Clustering for 2-Speaker Detection

摘要

著录项

相似文献

相关主题

期刊订阅