I-vectors and ILP clustering adapted to cross-show speaker diarization

机译：I矢量和ILP聚类适用于跨场演说者差异化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose to study speaker diarization from a collection of audio documents. The goal is to detect speakers appearing in several shows. In our approach, each show of the collection is processed separately before being processed collectively, to group speakers involved in several shows. Two clustering methods are studied for the overall processing of the collection: one uses the NCLR metric and the other is inspired by techniques based on i-vectors, mainly used in the speaker verification field. Both methods were evaluated on the whole training corpus of ESTER 2. The method based on the use of i-vectors achieves error rates similar to those obtained by the NCLR method, however, the computation time is on average 8.66 times faster. Therefore, this method is suitable for processing large volumes of data.

机译：我们建议从音频文档的集合中研究说话者的歧义化。目的是检测出现在几场演出中的说话者。在我们的方法中，收藏集的每个节目在进行集体处理之前都将被分别处理，以将参与多个节目的演讲者分组。研究了两种用于集合总体处理的聚类方法：一种使用NCLR度量，另一种则受到基于i矢量的技术的启发，主要用于说话人验证领域。两种方法都在ESTER 2的整个训练语料库上进行了评估。基于i向量的方法的错误率与NCLR方法相似，但是计算时间平均快了8.66倍。因此，此方法适用于处理大量数据。

著录项

来源
《Annual conference of the International Speech Communication Association》|2012年|2171-2174|共4页
会议地点
作者
Gregor Dupuy; Michael Rouvier; Sylvain Meignier; Yannick Esteve;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
speaker diarization; cross-show diarization; i-vectors; ilp clustering;

机译：说话人差异化跨场展示差异化; i向量ilp群集;
入库时间 2022-08-26 15:11:04

相似文献

外文文献
中文文献
专利

1. Improved i-Vector Representation for Speaker Diarization [J] . Xu Yan, McLoughlin Ian, Song Yan, Circuits, systems, and signal processing . 2016,第9期

机译：改进的i-Vector表示以实现说话人区分
2. Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news [J] . Dabbabi Karim, Hajji Salah, Cherif Adnen International journal of speech technology . 2019,第4期

机译：与K-means的混合DE用于演讲者广播新闻的演讲者聚类
3. Adaptive speaker diarization of broadcast news based on factor analysis [J] . Desplanques Brecht, Demuynck Kris, Martens Jean Pierre Computer speech and language . 2017,第nova期

机译：基于因子分析的广播新闻自适应说话人二元化
4. I-vectors and ILP clustering adapted to cross-show speaker diarization [C] . Grégor Dupuy, Mickael Rouvier, Sylvain Meignier, INTERSPEECH 2012 . 2012

机译：i-vectors和ILP聚类适用于跨展示扬声器日益改估
5. Automatic Speaker Recognition and Diarization in Co-Channel Speech [D] . Shokouhi, Navid. 2017

机译：同频道语音中的说话人自动识别和区分
6. Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research [O] . Lukas Fürer, Nathalie Schenk, Volker Roth, 2020

机译：使用随机森林监督扬声器日期：一种心理治疗过程研究的工具
7. Improved i-Vector Representation for Speaker Diarization [O] . Yan Xu, Ian McLoughlin, Yan Song, 2015

机译：改进的i-Vector表示以实现说话人区分
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

I-vectors and ILP clustering adapted to cross-show speaker diarization

摘要

著录项

相似文献

相关主题

期刊订阅