GMM-UBM based open-set online speaker diarization

机译：基于GMM-UBM的开放式在线扬声器二元化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present an open-set online speaker diarization system. The system is based on Gaussian mixture models (GMMs), which are used as speaker models. The system starts with just 3 such models (one each for both genders and one for non-speech) and creates models for individual speakers not till the speakers occur. As more and more speakers appear, more models are created. Our system implicitly performs audio segmentation, speechon-speech classification, gender recognition and speaker identification. The system is tested with the HUB4-1996 radio broadcast news database.

机译：在本文中，我们提出了一种开放式的在线说话者二值化系统。该系统基于用作说话者模型的高斯混合模型（GMM）。系统仅以3种这样的模型开始（每种模型分别用于性别和非语音），并为单个讲话者创建模型，直到出现讲话者为止。随着越来越多的扬声器出现，将创建更多模型。我们的系统隐式执行音频分割，语音/非语音分类，性别识别和说话人识别。该系统已通过HUB4-1996广播新闻数据库进行了测试。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2010》|2011年|p.2330-2333|共4页
会议地点
作者
Juergen Geiger; Frank Wallhoff; Gerhard Rigoll;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
speaker diarization; gaussian mixture models; open-set speaker recognition;

机译：说话人差异化高斯混合模型;开放式说话人识别;

相似文献

外文文献
中文文献
专利

1. ONLINE SPEAKER DIARIZATION FOR MULTIMEDIA DATA RETRIEVAL ON MOBILE DEVICES [J] . KYUNG-MI PARK, JEONG-SIK PARK, JAE-HYUN BAE, International Journal of Pattern Recognition and Artificial Intelligence . 2012,第8期

机译：移动设备上多媒体数据的在线说话人数字化检索
2. Gammachirp Filter Banks Applied in Roust Speaker Recognition Based on GMM-UBM Classifier [J] . Deng Lei, Gao Yong The international arab journal of information technology . 2020,第2期

机译：基于GMM-UBM分类器的ROUST扬声器识别伽马基杂交滤波器银行
3. GMM-UBM Based Speaker Verification in Multilingual Environments [J] . Kshirod Sarmah, Utpal Bhattacharjee International Journal of Computer Science Issues . 2012,第6期

机译：多语言环境中基于GMM-UBM的说话人验证
4. GMM-UBM based open-set online speaker diarization [C] . Juergen Geiger, Frank Wallhoff, Gerhard Rigoll Annual conference of the International Speech Communication Association . 2010

机译：基于GMM-UBM的开放式在线扬声器深度
5. Impact of asynchronous and text -based communication modalities on non-native speakers of English in fully online U.S. university courses [D] . Parker, Mark L. 2008

机译：基于和文本的沟通方式对完全在线美国大学课程的非母语英语通信方式的影响
6. Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research [O] . Lukas Fürer, Nathalie Schenk, Volker Roth, 2020

机译：使用随机森林监督扬声器日期：一种心理治疗过程研究的工具
7. End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors [O] . Shota Horiguchi, Yusuke Fujita, Shinji Watanabe, 2020

机译：用于基于编码器解码器的扬声器数量未知数量的扬声器的端到端扬声器深度
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

GMM-UBM based open-set online speaker diarization

摘要

著录项

相似文献

相关主题

期刊订阅