Extending the Task of Diarization to Speaker Attribution

机译：将差异化的任务扩展到说话者归因

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we extend the concept of speaker annotation within a single-recording, or speaker diarization, to a collection wide approach we call speaker attribution. Accordingly, speaker attribution is the task of clustering expectantly homogenous inter-session clusters obtained using diarization according to common cross-recording identities. The result of attribution is a collection of spoken audio across multiple recordings attributed to speaker identities. In this paper, an attribution system is proposed using mean-only MAP adaptation of a combined-gender UBM to model clusters from a perfect diarization system, as well as a JFA-based system with session variability compensation. The normalized cross-likelihood ratio is calculated for each pair of clusters to construct an attribution matrix and the complete linkage algorithm is employed to conduct clustering of the inter-session clusters. A matched cluster purity and coverage of 87.1% was obtained on the NIST 2008 SRE corpus.

机译：在本文中，我们将单次录音或说话者二值化内的说话者注释概念扩展到了我们称为说话者归因的广泛收集方法中。因此，说话者归因是根据通用的交叉记录身份对使用均匀化获得的预期同质的会话间聚类进行聚类的任务。归因的结果是归因于说话者身份的多个录音中的语音音频集合。在本文中，提出了一种归因系统，该归因系统使用了组合性别UBM的平均均值MAP自适应方法，以从理想的离散化系统以及具有会话可变性补偿的基于JFA的系统中对集群进行建模。为每对集群计算归一化的交叉似然比，以构造一个归因矩阵，并采用完整的链接算法对会话间集群进行聚类。在NIST 2008 SRE语料库中获得了匹配的簇纯度和87.1％的覆盖率。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2011》|2011年|p.1056-1059|共4页
会议地点
作者
Houman Ghaemmaghami; David Dean; Robbie Vogt; Sridha Sridharan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
speaker attribution; diarization; clustering; cross likelihood ratio; joint factor analysis;

机译：说话者归因;差异化集群交叉似然比;联合因素分析;

相似文献

外文文献
中文文献
专利

1. Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: Theory, implementation and analysis on standard tasks [J] . Federico Landini, Jan Profant, Mireia Diez, Computer speech and language . 2022,第Jana期

机译：扬声器日复速病中X-Vector序列（VBX）的Bayesian HMM聚类：理论，实施和标准任务的实施和分析
2. Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news [J] . Dabbabi Karim, Hajji Salah, Cherif Adnen International journal of speech technology . 2019,第4期

机译：与K-means的混合DE用于演讲者广播新闻的演讲者聚类
3. Probabilistic Speaker Diarization With Bag-of-Words Representations of Speaker Angle Information [J] . Ishiguro K., Yamada T., Araki S., Audio, Speech, and Language Processing, IEEE Transactions on . 2012,第2期

机译：说话者角度信息的词袋表示概率的说话人区分
4. Incorporation of the ASR output in speaker segmentation and clustering within the task of speaker diarization of broadcast streams [C] . Silovsky Jan, Zdansky Jindrich, Nouza Jan, 2012 IEEE 14th International Workshop on Multimedia Signal Processing. . 2012

机译：在广播流的说话人二分之一化任务中，将ASR输出并入说话人分割和聚类中
5. Automatic Speaker Recognition and Diarization in Co-Channel Speech [D] . Shokouhi, Navid. 2017

机译：同频道语音中的说话人自动识别和区分
6. Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research [O] . Lukas Fürer, Nathalie Schenk, Volker Roth, 2020

机译：使用随机森林监督扬声器日期：一种心理治疗过程研究的工具
7. Extending the task of diarization to speaker attribution [O] . Ghaemmaghami Houman, Dean David, Vogt Robbie, 2011

机译：将差异化的任务扩展到说话者归因
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Extending the Task of Diarization to Speaker Attribution

摘要

著录项

相似文献

相关主题

期刊订阅