首页> 外文OA文献 >DNN approach to speaker diarisation using speaker channels
【2h】

DNN approach to speaker diarisation using speaker channels

机译:DNN使用扬声器通道进行扬声器扩音的方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Speaker diarisation addresses the question of 'who speaks when' in audio recordings, and has been studied extensively in the context of tasks such as broadcast news, meetings, etc. Performing diarisation on individual headset microphone (IHM) channels is sometimes assumed to easily give the desired output of speaker labelled segments with timing information. However, it is shown that given imperfect data, such as speaker channels with heavy crosstalk and overlapping speech, this is not the case. Deep neural networks (DNNs) can be trained on features derived from the concatenation of speaker channel features to detect which is the correct channel for each frame. Crosstalk features can be calculated and DNNs trained with or without overlapping speech to combat problematic data. A simple frame decision metric of counting occurrences is investigated as well as adding a bias against selecting nonspeech for a frame. Finally, two different scoring setups are applied to both datasets. The stricter SHEF setup finds diarisation error rates (DER) of 9.2% on TBL and 23.2% on RT07 while the NIST setup achieves 5.7% and 15.1% respectively.
机译:演讲者差异化解决了录音中“谁说话的时间”的问题,并且在广播新闻,会议等任务的背景下进行了广泛研究。有时认为在单个头戴式麦克风(IHM)频道上进行差异化会很容易带有定时信息的扬声器标记段的期望输出。但是,事实表明,如果给定的数据不完整,例如串扰严重且语音重叠的扬声器通道,情况并非如此。深度神经网络(DNN)可以在从扬声器通道特征的级联中得出的特征上进行训练,以检测哪个是每个帧的正确通道。可以计算出串扰特征,并在有或没有重叠语音的情况下训练DNN,以解决问题数据。研究了计数发生次数的简单帧决策度量,以及增加了对选择帧的不发声的偏见。最后,将两种不同的评分设置应用于两个数据集。更严格的SHEF设置在TBL上的分辨错误率(DER)为9.2%,在RT07上为23.2%,而NIST设置分别达到5.7%和15.1%。

著录项

  • 作者

    Milner R.; Hain T.;

  • 作者单位
  • 年度 2017
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号