Investigating Various Diarization Algorithms for Speaker in the Wild (SITW) Speaker Recognition Challenge

机译：在野生（SITW）扬声器识别挑战中调查扬声器的各种日复一衰算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Collecting training data for real-world text-independent speaker recognition is challenging. In practice, utterances for a specific speaker are often mixed with many other acoustic signals. To guarantee the recognition performance, the segments spoken by target speakers should be precisely picked out. An automatic detection could be developed to reduce the cost of expensive human hand-made annotations. One way to achieve this goal is by using speaker diarization as a pre-processing step in the speaker enrollment phase. To this end, three speaker diarization algorithms based on Bayesian information criterion (BIC), agglomerative information bottleneck (aIB) and i-vector are investigated in this paper. The corresponding impacts on the results of speaker recognition system are also studied. Experiments conducted on Speaker in the Wild (SITW) Speaker Recognition Challenge (SRC) 2016 showed that the utilization of a proper speaker diarization improves the overall performance. Some more efforts are made to combine these methods together as well.

机译：收集现实世界文本独立扬声器识别的培训数据是具有挑战性的。在实践中，特定扬声器的话语通常与许多其他声学信号混合。为了保证识别性能，应准确地挑选目标发言者所说的细分。可以开发自动检测以降低昂贵的人类手工制作注释的成本。实现这一目标的一种方法是通过使用扬声器日益改估作为扬声器注册阶段的预处理步骤。为此，本文研究了基于贝叶斯信息标准（BIC），附聚信息瓶颈（AIB）和I形载体的三个扬声器深度算法。还研究了对扬声器识别系统结果的相应影响。在野外（SITW）扬声器识别挑战（SRC）2016上对扬声器进行的实验表明，利用适当的扬声器深度提高了整体性能。还有一些努力也将这些方法组合在一起。

著录项

来源
《Annual Conference of the International Speech Communication Association》|2016年|p745-1531|共5页
会议地点
作者
Yi Liu; Yao Tian; Liang He; Jia Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TB95-53;
关键词
入库时间 2022-08-21 11:41:05

相似文献

外文文献
中文文献
专利

1. State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and Speakers in the Wild evaluations [J] . Jesus Villalba, Nanxin Chen, David Snyder, Computer speech and language . 2020,第Mara期

机译：NIST SRE18中具有神经网络嵌入功能的最先进的说话人识别功能，Wild评估中的说话人功能
2. Real-Time Implementation of Speaker Diarization System on Raspberry PI3 Using TLBO Clustering Algorithm [J] . Dabbabi Karim, Hajji Salah, Cherif Adnen Circuits, systems, and signal processing . 2020,第8期

机译：用TLBO聚类算法实时实施覆盆子PI3上的扬声器日复速度系统
3. Investigation of the effect of data duration and speaker gender on text-independent speaker recognition [J] . Cemal Hanilci, Figen Ertas Computers and Electrical Engineering . 2013,第2期

机译：研究数据持续时间和说话人性别对与文本无关的说话人识别的影响
4. Investigating Various Diarization Algorithms for Speaker in the Wild (SITW) Speaker Recognition Challenge [C] . Yi Liu, Yao Tian, Liang He, Annual Conference of the International Speech Communication Association . 2016

机译：调查野生（SITW）扬声器识别挑战中扬声器的各种日复日复速度算法
5. Automatic Speaker Recognition and Diarization in Co-Channel Speech [D] . Shokouhi, Navid. 2017

机译：同频道语音中的说话人自动识别和区分
6. Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research [O] . Lukas Fürer, Nathalie Schenk, Volker Roth, 2020

机译：使用随机森林监督扬声器日期：一种心理治疗过程研究的工具
7. Speakers In The Wild (SITW): The QUT Speaker Recognition System [O] . Ghaemmaghami Houman, Rahman Md Hafizur, Himawan Ivan, 2016

机译：野外演说者（SITW）：QUT演说者识别系统
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Investigating Various Diarization Algorithms for Speaker in the Wild (SITW) Speaker Recognition Challenge

摘要

著录项

相似文献

相关主题

期刊订阅