首页> 外文会议>International Conference on speech and computer >Neural Network Speaker Descriptor in Speaker Diarization of Telephone Speech
【24h】

Neural Network Speaker Descriptor in Speaker Diarization of Telephone Speech

机译:电话语音说话人差异化中的神经网络说话人描述符

获取原文

摘要

In this paper, we have been investigating an approach to a speaker representation for a diarization system that clusters short telephone conversation segments (produced by the same speaker). The proposed approach applies a neural-network-based descriptor that replaces a usual i-vector descriptor in the state-of-the-art diarization systems. The comparison of these two techniques was done on the English part of the CallHome corpus. The final results indicate the superiority of the i-vector's approach although our proposed descriptor brings an additive information. Thus, the combined descriptor represents a speaker in a segment for diarization purpose with lower diarization error (almost 20% relative improvement compared with only i-vector application).
机译:在本文中,我们一直在研究一种将简短的电话会话段(由同一说话人产生的声音)聚类的差异化系统中说话人表示的方法。所提出的方法应用了基于神经网络的描述符,该描述符替代了最新的数字化系统中常用的i-vector描述符。这两种技术的比较是在CallHome语料库的英语部分进行的。尽管我们提出的描述符带来了附加信息,但最终结果表明了i-vector方法的优越性。因此,组合的描述符表示用于分割目的的片段中的说话者,并且具有较低的分割误差(与仅i-vector应用相比,相对改善了将近20%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号