Comparing Multi-Stage Approaches for Cross-Show Speaker Diarization

机译：比较跨阶段演讲者差异化的多阶段方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Acoustic speaker diarization is investigated for situations where a collection of shows from the same source needs to be processed. In this case, the same speaker should receive the same label across all shows. We compare different architectures for cross-show speaker diarization: the obvious concatenation of all shows, a hybrid system combining first a local clustering stage followed by a global clustering stage, and an incremental system which processes the shows in a predefined order and updates the speaker models accordingly. This latter system being best suited to real applicative situations. These three strategies were compared to a baseline single-show system on a set of 46 ten-minutes samples of British English scientific podcasts.

机译：针对需要处理来自同一来源的节目集合的情况，对声学扬声器的二元化进行了研究。在这种情况下，同一位发言人在所有演出中都应获得相同的标签。我们比较了跨节目演讲者差异化的不同体系结构：所有节目的明显串联;混合系统首先结合了本地聚类阶段，然后是全局聚类阶段;还有一个增量系统，该系统按预定义的顺序处理节目并更新说话者相应地进行建模。后一种系统最适合实际应用情况。在一组46个十分钟的英式英语科学播客样本中，将这三种策略与基线单播系统进行了比较。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2011》|2011年|p.1060-1063|共4页
会议地点
作者
Viet-Anh Tran; Viet Bac Le; Claude Barras; Lori Lamel;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
speaker diarization; speaker segmentation and clustering; cross-show diarization;

机译：说话人差异化说话人细分和聚类;跨场展示差异化;

相似文献

外文文献
中文文献
专利

1. A Comparative Study of Bottom-Up and Top-Down Approaches to Speaker Diarization [J] . Evans N., Bozonnet S., Dong Wang, Audio, Speech, and Language Processing, IEEE Transactions on . 2012,第2期

机译：自下而上和自上而下的说话人差异化方法的比较研究
2. Step-by-step and integrated approaches in broadcast news speaker diarization [J] . Sylvain Meignier, Daniel Moraru, Corinne Fredouille, Computer speech and language . 2006,第2a3期

机译：广播新闻发言人二元化的分步集成方法
3. Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news [J] . Dabbabi Karim, Hajji Salah, Cherif Adnen International journal of speech technology . 2019,第4期

机译：与K-means的混合DE用于演讲者广播新闻的演讲者聚类
4. Investigation of speaker embeddings for cross-show speaker diarization [C] . Michael Rouvier, Benoit Favre IEEE International Conference on Acoustics, Speech and Signal Processing . 2016

机译：跨节目演讲者差异化的演讲者嵌入研究
5. Automatic Speaker Recognition and Diarization in Co-Channel Speech [D] . Shokouhi, Navid. 2017

机译：同频道语音中的说话人自动识别和区分
6. Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research [O] . Lukas Fürer, Nathalie Schenk, Volker Roth, 2020

机译：使用随机森林监督扬声器日期：一种心理治疗过程研究的工具
7. Multi-stage speaker diarization of broadcast news [O] . Barras, Claude, Zhu, Xuan, Meignier, Sylvain, 2006

机译：广播新闻的多级发言人二分法

Comparing Multi-Stage Approaches for Cross-Show Speaker Diarization

摘要

著录项

相似文献

相关主题

期刊订阅