首页> 外文会议>IEEE Convention of Electrical and Electronics Engineers in Israel >Mahalanobis based emission model for speaker diarization of telephone conversations
【24h】

Mahalanobis based emission model for speaker diarization of telephone conversations

机译:基于Mahalanobis的电话交谈扬声器日复速度的发射模型

获取原文

摘要

The primary objective of any speaker diarization system is to designate speech segments to one of K speakers in the conversation. In this work we will focus on telephone conversations, where the number of speakers is given and equal 2. We use a hidden-distortion-model (HDM)-based system. HDM allows using different emission models as speaker models. The choice of adequate emission models, properly representing the data characteristics is important for the systems' performance. We investigate the effect of several codebooks (CBs) based emission models, with Euclidian and Mahalanobis distances. The Mahalanobis distance was chosen due its potential to produce a better representation of the data's spatial layout, while limitations where maid to retain the model from divergence. The influence of the different methods is evaluated using 108 telephone conversations taken from the LDC CallHome corpus. All the experiments achieved results poorer than the original SOM-based system (DER=12.70%).
机译:任何扬声器日记系统的主要目标是将语音段指定为谈话中的K扬声器之一。在这项工作中,我们将专注于电话对话,其中给出扬声器的数量和等于2.我们使用隐藏的失真模型(HDM)的系统。 HDM允许使用不同的发射模型作为扬声器模型。适当的发射模型的选择,代表数据特征对于系统的性能很重要。我们调查了基于欧几里德和Mahalanobis距离的若干码本(CBS)的发射模型对若干码本(CBS)的发射模型的影响。选择Mahalanobis距离由于其可能产生更好的数据的空间布局表示,而MAID将从分歧中保留模型的限制。使用从LDC Callhome语料库中获取的108个电话对话来评估不同方法的影响。所有实验所达到的结果比原始SOM的系统更差(Der = 12.70%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号