首页> 外文会议>INTERSPEECH 2012 >I-vectors and ILP clustering adapted to cross-show speaker diarization

【24h】

I-vectors and ILP clustering adapted to cross-show speaker diarization

机译：i-vectors和ILP聚类适用于跨展示扬声器日益改估

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose to study speaker diarization from a collection of audio documents. The goal is to detect speakers appearing in several shows. In our approach, each show of the collection is processed separately before being processed collectively, to group speakers involved in several shows. Two clustering methods are studied for the overall processing of the collection: one uses the NCLR metric and the other is inspired by techniques based on i-vectors, mainly used in the speaker verification field. Both methods were evaluated on the whole training corpus of ESTER 2. The method based on,the use of i-vectors achieves error rates similar to those obtained by the NCLR method, however, the computation time is on average 8.66 times faster. Therefore, this method is suitable for processing large volumes of data.

机译：我们建议从一系列音频文件学习扬声器日益改血。目标是检测几个节目中出现的扬声器。在我们的方法中，在共同处理之前，每个集合的每个节目都是单独处理的，对涉及几个节目的小组扬声器。研究了两个聚类方法，用于集合的整体处理：一个使用NCLR度量，另一个由基于I-Viptors的技术启发，主要用于扬声器验证领域。在酯类的整个训练语料库中评估了两种方法。该方法基于，使用I-vOcs的使用与NCLR方法获得的误差率类似，但是，计算时间平均速度更快8.66倍。因此，该方法适用于处理大量数据。

著录项

来源
《INTERSPEECH 2012》|2012年||共4页
会议地点
作者
Grégor Dupuy; Mickael Rouvier; Sylvain Meignier; Yannick Estève;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 73.4136083;
关键词
speaker diarization; cross-show diarization; i-vectors; ilp clustering;

机译：扬声器日益衰退;横向展现;i-vectors;ILP聚类;

相似文献

外文文献
专利

1. Improved i-Vector Representation for Speaker Diarization [J] . Xu Yan, McLoughlin Ian, Song Yan, Circuits, systems, and signal processing . 2016,第9期

机译：改进的i-Vector表示以实现说话人区分
2. Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news [J] . Dabbabi Karim, Hajji Salah, Cherif Adnen International journal of speech technology . 2019,第4期

机译：与K-means的混合DE用于演讲者广播新闻的演讲者聚类
3. Adaptive speaker diarization of broadcast news based on factor analysis [J] . Desplanques Brecht, Demuynck Kris, Martens Jean Pierre Computer speech and language . 2017,第nova期

机译：基于因子分析的广播新闻自适应说话人二元化
4. I-vectors and ILP clustering adapted to cross-show speaker diarization [C] . Gregor Dupuy, Michael Rouvier, Sylvain Meignier, Annual conference of the International Speech Communication Association . 2012

机译：I矢量和ILP聚类适用于跨场演说者差异化
5. Automatic Speaker Recognition and Diarization in Co-Channel Speech [D] . Shokouhi, Navid. 2017

机译：同频道语音中的说话人自动识别和区分
6. Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research [O] . Lukas Fürer, Nathalie Schenk, Volker Roth, 2020

机译：使用随机森林监督扬声器日期：一种心理治疗过程研究的工具
7. Improved i-Vector Representation for Speaker Diarization [O] . Yan Xu, Ian McLoughlin, Yan Song, 2015

机译：改进的i-Vector表示以实现说话人区分
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

I-vectors and ILP clustering adapted to cross-show speaker diarization

摘要

著录项

相似文献

相关主题

期刊订阅