首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Transfer Learning Using Raw Waveform Sincnet for Robust Speaker Diarization
【24h】

Transfer Learning Using Raw Waveform Sincnet for Robust Speaker Diarization

机译:使用原始波形SCINNET转移学习,用于强大的扬声器日益改变

获取原文

摘要

Speaker diarization tells who spoke and when? in an audio stream. SincNet is a recently developed novel convolutional neural network (CNN) architecture where the first layer consists of parameterized sinc filters. Unlike conventional CNNs, SincNet take raw speech waveform as input. This paper leverages SincNet in vanilla transfer learning (VTL) setup. Out-domain data is used for training SincNet-VTL to perform frame-level speaker classification. Trained SincNet-VTL is later utilized as feature extractor for in-domain data. We investigated pooling (max, avg) strategies for deriving utterance-level embedding using frame-level features extracted from trained network. These utterance/segment level embedding are adopted as speaker models during clustering stage in diarization pipeline. We compared the proposed SincNet-VTL embedding with baseline i-vector features. We evaluated our approaches on two corpora, CRSS-PLTL and AMI. Results show the efficacy of trained SincNet-VTL for speaker-discriminative embedding even when trained on small amount of data. Proposed features achieved relative DER improvements of 19.12% and 52.07% for CRSS-PLTL and AMI data, respectively over baseline i-vectors.
机译:扬声器日益改善告诉谁说话和何时?在音频流中。 SINCNET是最近开发的新型卷积神经网络(CNN)架构,第一层由参数化SINC滤波器组成。与传统的CNN不同,SINCNET将原始语音波形作为输入。本文利用香草传输学习(VTL)设置的SINGNET。 OUT域数据用于培训SINCNET-VTL以执行帧级扬声器分类。培训的SINCNET-VTL稍后将用作域内数据的特征提取器。我们调查了使用从培训的网络中提取的帧级别嵌入的发出级别嵌入的汇集(MAX,AVG)策略。这些话语/段级别嵌入作为扬声器模型在日复日复速度管道中的聚类阶段。我们将建议的SincNet-VTL与基线I-Vector特征进行比较。我们评估了我们在两个Corcle,CRSS-PLTL和AMI上的方法。结果表明,即使在少量数据训练时,训练有素的SincNet-VTL用于扬声器歧视性嵌入的功效。在基线I载体上,所提出的特征在基线I - 载体上,CRSS-PLTL和AMI数据的相对改善19.12%和52.07%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号