Unfolding speaker clustering potential

机译：展开扬声器聚类潜力

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speaker clustering is the task of grouping a set of speech utterances into speaker-specific classes. The basic techniques for solving this task are similar to those used for speaker verification and identification. The hypothesis of this paper is that the techniques originally developed for speaker verification and identification are not sufficiently discriminative for speaker clustering. However, the processing chain for speaker clustering is quite large - there are many potential areas for improvement. The question is: where should improvements be made to improve the final result? To answer this question, this paper takes a biomimetic approach based on a study with human participants acting as an automatic speaker clustering system. Our findings are twofold: it is the stage of modeling that has the highest potential, and information with respect to the temporal succession of frames is crucially missing. Experimental results with our implementation of a speaker clustering systemincorporating our findings and applying it on TIMIT data show the validity of our approach.

机译：扬声器聚类是将一组语音发言分组成特定演讲的类的任务。解决此任务的基本技术与用于扬声器验证和识别的基本技术类似。本文的假设是，最初为扬声器验证和识别开发的技术对扬声器聚类没有充分判别。然而，扬声器聚类的加工链非常大 - 有许多潜在的改进区域。问题是：在哪里应该改进改善最终结果？为了回答这个问题，本文采用了一种基于与人类参与者作为自动扬声器聚类系统的研究的仿生方法。我们的研究结果是双重的：它是具有最高电位的建模阶段，以及关于框架的时间顺序的信息令人遗憾。实验结果随着我们在扬声器聚类系统的实施我们的调查结果并将其应用于Timit数据，显示了我们方法的有效性。

著录项

来源
《ACM international conference on Multimedia》|2009年||共10页
会议地点
作者
Thilo Stadelmann; Bernd Freisleben;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类多媒体技术与多媒体计算机;
关键词
GMM; MFCC; one-class SVM; speaker clustering; speaker diarization; speaker identification; temporal context;

机译：GMM;MFCC;单级SVM;扬声器聚类;扬声器日益衰退;扬声器识别;时间上下文;

相似文献

外文文献
中文文献
专利

1. Phonological Activation in Korean Word Recognition between Korean Native Speakers and Japanese-Korean and Chinese-Korean Bilingual Speakers: Evidence from Event-Related Potentials [J] . Zeitschrift fur Arznei- und Gewurzpflanzen . 2020,第1期

机译：韩国母语和日语和中韩双语演讲者韩语单词认可中的语音激活：来自事件相关的潜力的证据
2. Speaker specific feature based clustering and its applications in language independent forensic speaker recognition [J] . Satyanand Singh, Pragya Singh International Journal of Electrical and Computer Engineering . 2020,第4期

机译：基于扬声器特定的功能的聚类及其在语言独立法医扬声器识别中的应用
3. Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news [J] . Dabbabi Karim, Hajji Salah, Cherif Adnen International journal of speech technology . 2019,第4期

机译：与K-means的混合DE用于演讲者广播新闻的演讲者聚类
4. Unfolding speaker clustering potential [C] . Thilo Stadelmann, Bernd Freisleben ACM international conference on Multimedia . 2009

机译：展现扬声器聚集潜力
5. Efficient speaker recognition using speaker model clusters. [D] . Apsingekar, Vijendra Raj. 2009

机译：使用说话人模型集群进行有效的说话人识别。
6. WW: An isolated three-stranded antiparallel beta-sheet domain that unfolds and refolds reversibly; evidence for a structured hydrophobic cluster in urea and GdnHCl and a disordered thermal unfolded state. [O] . E. K. Koepf, H. M. Petrassi, M. Sudol, 1999

机译：WW：分离的三链反平行β-折叠结构域可逆地展开和折叠。尿素和GdnHCl中的结构化疏水簇以及无序的热展开状态的证据。
7. Speaker Model Clustering to Construct Background Models for Speaker Verification [O] . Gökay Dişken, Zekeriya Tüfekci, Ulus Çevik 2017

机译：扬声器模型聚类构建扬声器验证的背景模型

Unfolding speaker clustering potential

摘要

著录项

相似文献

相关主题

期刊订阅