Incremental Transfer Learning in Two-pass Information Bottleneck Based Speaker Diarization System for Meetings

机译：基于两键信息瓶颈的扬声器日复速度系统的增量转移学习

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The two-pass information bottleneck (TPIB) based speaker diarization system operates independently on different conversational recordings. TPIB system does not consider previously learned speaker discriminative information while di-arizing new conversations. Hence, the real time factor (RTF) of TPIB system is high owing to the training time required for the artificial neural network (ANN). This paper attempts to improve the RTF of the TPIB system using an incremental transfer learning approach where the parameters learned by the ANN from other conversations are updated using current conversation rather than learning parameters from scratch. This reduces the RTF significantly. The effectiveness of the proposed approach compared to the baseline IB and the TPIB systems is demonstrated on standard NIST and AMI conversational meeting datasets. With a minor degradation in performance, the proposed system shows a significant improvement of 33.07% and 24.45% in RTF with respect to TPIB system on the NIST RT-04Eval and AMI-1 datasets, respectively.

机译：双通信息瓶颈（TPIB）的扬声器深度化系统在不同的会话记录上独立运行。 TPIB系统不考虑以前学识到的扬声器歧视信息，同时引发了新的对话。因此，由于人工神经网络（ANN）所需的训练时间，TPIB系统的实时因素（RTF）很高。本文试图使用增量转移学习方法改进TPIB系统的RTF，其中来自其他对话中的ANN从其他对话中学到的参数使用当前对话而不是从头开始进行更新。这显着降低了RTF。与基线IB和TPIB系统相比，所提出的方法的有效性在标准NIST和AMI对话会议数据集上展示。随着性能微小的降解，所提出的系统分别在RTF中分别在NIST RT-04EVAL和AMI-1数据集上的TPIB系统中显着提高33.07％和24.45％。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2019年|p5996-6664|共5页
会议地点
作者
Nauman Dawalatabad; Srikanth Madikeri; C Chandra Sekhar; Hema A Murthy;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
Speaker diarization; transfer learning; information bottleneck;

机译：扬声器日益改估;转移学习;信息瓶颈;

相似文献

外文文献
中文文献
专利

1. Active Learning Based Constrained Clustering For Speaker Diarization [J] . Chengzhu Yu, John H. L. Hansen Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2017,第11期

机译：基于主动学习的约束聚类用于说话人区分
2. A new architecture based VAD for speaker diarization/detection systems [J] . Ouassila Kenai, Siham Ouamour, Mhania Guerti, International journal of speech technology . 2019,第3期

机译：基于新架构的VAD，用于说话人区分/检测系统
3. Initialization of Iterative-Based Speaker Diarization Systems for Telephone Conversations [J] . Ben-Harush O., Ben-Harush O., Lapidot I., Audio, Speech, and Language Processing, IEEE Transactions on . 2012,第2期

机译：电话会议基于迭代的说话人区分系统的初始化
4. Incremental Transfer Learning in Two-pass Information Bottleneck Based Speaker Diarization System for Meetings [C] . Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar, IEEE International Conference on Acoustics, Speech and Signal Processing . 2019

机译：基于两遍信息瓶颈的会议演讲者差异化系统中的增量转移学习
5. Use of speaker location features in meeting diarization. [D] . Otterson, Scott. 2008

机译：会议发言者使用语音定位功能。
6. Performance of a Deep Neural Network Algorithm Based on a Small Medical Image Dataset: Incremental Impact of 3D-to-2D Reformation Combined with Novel Data Augmentation Photometric Conversion or Transfer Learning [O] . Vikash Gupta, Mutlu Demirer, Matthew Bigelow, 2020

机译：基于小型医学图像数据集的深神经网络算法的性能：3D-2D改革与新型数据增强光度转换或转移学习结合的增量影响
7. Incremental Transfer Learning in Two-pass Information Bottleneck Based Speaker Diarization System for Meetings [O] . Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar, 2019

机译：基于两键信息瓶颈的扬声器日复速度系统的增量转移学习

Incremental Transfer Learning in Two-pass Information Bottleneck Based Speaker Diarization System for Meetings

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅