Covariance Based Deep Feature for Text-Dependent Speaker Verification

机译：基于协方差的深度特征，用于文本相关的说话人验证

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

d-vector approach achieved impressive results in speaker verification. Representation is obtained at utterance level by calculating the mean of the frame level outputs of a hidden layer of the DNN. Although mean based speaker identity representation has achieved good performance, it ignores the variability of frames across the whole utterance, which consequently leads to information loss. This is particularly serious for text-dependent speaker verification, where within-utterance feature variability better reflects text variability than the mean. To address this issue, a new covariance based speaker representation is proposed in this paper. Here, covariance of the frame level outputs is calculated and incorporated into the speaker identity representation. The proposed approach is investigated within a joint multi-task learning framework for text-dependent speaker verification. Experiments on RSR2015 and RedDots showed that, covariance based deep feature can significantly improve the performance compared to the traditional mean based deep features.

机译：d矢量方法在说话人验证中取得了令人印象深刻的结果。通过计算DNN隐藏层的帧级别输出的平均值，以发声级别获得表示。尽管基于均值的说话人身份表示已经取得了良好的性能，但它忽略了整个发声中帧的可变性，从而导致信息丢失。这对于依赖文本的说话人验证尤为严重，其中话语内特征的变异性比均值更好地反映了文本变异性。为了解决这个问题，本文提出了一种新的基于协方差的说话人表示方法。在此，计算帧电平输出的协方差并将其并入说话者身份表示中。在联合多任务学习框架内对提出的方法进行了研究，以进行与文本相关的说话人验证。在RSR2015和RedDots上进行的实验表明，与传统的基于均值的深度特征相比，基于协方差的深度特征可以显着提高性能。

著录项

来源
《International Conference on intelligent science and big data engineering》|2018年|231-242|共12页
会议地点
作者
Shuai Wang; Heinrich Dinkel; Yanmin Qian; Kai Yu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deep features; Text-dependent speaker verification; Speaker recognition; d-vector; j-vector; Covariance discrimination;

机译：深层特征;与文本相关的说话者验证;说话人识别; d向量;向量协方差判别;

相似文献

外文文献
中文文献
专利

1. Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification [J] . Sarkar Achintya Kumar, Tan Zheng-Hua, Tang Hao, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019,第8期

机译：基于时间对比学习的深层瓶颈功能，用于文本相关的说话人验证
2. Self-segmentation of pass-phrase utterances for deep feature learning in text-dependent speaker verification [J] . Achintya Kumar Sarkar, Zheng-Hua Tan Computer speech and language . 2021,第Nova期

机译：文本依赖扬声器验证中深度特征学习的通行证话语的自我分割
3. Articulatory movement features for short-duration text-dependent speaker verification [J] . Zhang Yan, Long Yanhua, Shen Xiangrong, International journal of speech technology . 2017,第4期

机译：关节运动功能，用于短时文本相关说话者验证
4. Covariance Based Deep Feature for Text-Dependent Speaker Verification [C] . Shuai Wang, Heinrich Dinkel, Yanmin Qian, International Conference on Intelligence Science and Big Data Engineering . 2018

机译：基于协方识的文本依赖扬声器验证的深度特征
5. Deep Neural Network Based Speaker Verification Under Domain Mismatched Conditions [D] . Zhang, Chunlei. 2019

机译：基于深度神经网络的扬声器验证在域不匹配条件下
6. Bidirectional Attention for Text-Dependent Speaker Verification [O] . Xin Fang, Tian Gao, Liang Zou, 2020

机译：文本依赖扬声器验证的双向关注
7. Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification [O] . Achintya Kumar Sarkar, Zheng-Hua Tan, Hao Tang, 2019

机译：基于时间对比的学习基于文本依赖扬声器验证的深瓶颈特征

Covariance Based Deep Feature for Text-Dependent Speaker Verification

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅