首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Incremental Transfer Learning in Two-pass Information Bottleneck Based Speaker Diarization System for Meetings
【24h】

Incremental Transfer Learning in Two-pass Information Bottleneck Based Speaker Diarization System for Meetings

机译:基于两键信息瓶颈的扬声器日复速度系统的增量转移学习

获取原文
获取外文期刊封面目录资料

摘要

The two-pass information bottleneck (TPIB) based speaker diarization system operates independently on different conversational recordings. TPIB system does not consider previously learned speaker discriminative information while di-arizing new conversations. Hence, the real time factor (RTF) of TPIB system is high owing to the training time required for the artificial neural network (ANN). This paper attempts to improve the RTF of the TPIB system using an incremental transfer learning approach where the parameters learned by the ANN from other conversations are updated using current conversation rather than learning parameters from scratch. This reduces the RTF significantly. The effectiveness of the proposed approach compared to the baseline IB and the TPIB systems is demonstrated on standard NIST and AMI conversational meeting datasets. With a minor degradation in performance, the proposed system shows a significant improvement of 33.07% and 24.45% in RTF with respect to TPIB system on the NIST RT-04Eval and AMI-1 datasets, respectively.
机译:双通信息瓶颈(TPIB)的扬声器深度化系统在不同的会话记录上独立运行。 TPIB系统不考虑以前学识到的扬声器歧视信息,同时引发了新的对话。因此,由于人工神经网络(ANN)所需的训练时间,TPIB系统的实时因素(RTF)很高。本文试图使用增量转移学习方法改进TPIB系统的RTF,其中来自其他对话中的ANN从其他对话中学到的参数使用当前对话而不是从头开始进行更新。这显着降低了RTF。与基线IB和TPIB系统相比,所提出的方法的有效性在标准NIST和AMI对话会议数据集上展示。随着性能微小的降解,所提出的系统分别在RTF中分别在NIST RT-04EVAL和AMI-1数据集上的TPIB系统中显着提高33.07%和24.45%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号