LSTM Based End-to-End Text-Independent Speaker Verification Using Raw Waveform

机译：使用原始波形的基于LSTM的端到端文本无关的说话人验证

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speaker can be discriminated either at voice source level or vocal tract system level. Conventionally Mel-Frequency Cesptral Coefficients (MFCCs) or Mel filterbank energies are employed as input acoustic feature in neural network based speaker verification systems. In this paper, we investigate the LSTM based speaker verification using raw waveform as input feature. The basic LSTM based SV model and the model with attention layer are trained and optimized on two datasets using raw waveform feature and Fbank feature respectively. And experimental results show that compared with the model trained using Fbank feature, the model trained using raw waveform can achieve promising performance, raw waveform is a competitive acoustic feature for LSTM based speaker verification.

机译：可以在语音源级别或声道系统级别区分说话者。常规地，在基于神经网络的说话者验证系统中，采用梅尔频率中枢系数（MFCC）或梅尔滤波器组能量作为输入声学特征。在本文中，我们研究了使用原始波形作为输入功能的基于LSTM的扬声器验证。基于LSTM的基本SV模型和带有关注层的模型分别使用原始波形特征和Fbank特征在两个数据集上进行了训练和优化。实验结果表明，与使用Fbank功能训练的模型相比，使用原始波形训练的模型可以实现有希望的性能，原始波形是基于LSTM的说话人验证的竞争声学功能。

著录项

来源
《International Conference on Culture-oriented Science and Technology》|2020年|500-503|共4页
会议地点
作者
Jing He; Pengwei Zhang; Liangjin Zhu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Acoustics; Neural networks; Conferences; Computational modeling; Chemical reactors; Convolution;

机译：特征提取;声学;神经网络;会议;计算模型;化学反应器;卷积;
入库时间 2022-08-26 14:36:13

相似文献

外文文献
中文文献
专利

1. A novel text-independent speaker verification method based on the global speaker model [J] . Yiying Zhang, Zhang D. IEEE transactions on systems, man, and cybernetics. Part A . 2000,第5期

机译：基于全局说话人模型的文本无关说话人验证方法
2. Correction to a novel text-independent speaker verification method based on the global speaker model [J] . Zhang Y., Zhu X. IEEE transactions on systems, man, and cybernetics. Part A . 2000,第6期

机译：基于全局说话人模型的新型与文本无关的说话人验证方法的校正
3. End-to-end DNN based text-independent speaker recognition for long and short utterances [J] . Rohdin Johan, Silnova Anna, Diez Mireia, Computer speech and language . 2020,第Jana期

机译：基于端到端DNN的，与文本无关的说话人识别，可实现长话和短话
4. End-to-End Feature Learning for Text-Independent Speaker Verification [C] . Fangzhou Chen, Tengyue Bian, Li Xu Chinese Control and Decision Conference . 2019

机译：端到端特征学习，用于独立于文本的说话者验证
5. Speaker adaptation in joint factor analysis based text independent speaker verification [D] . Shou-Chun, Yin 2007

机译：基于联合因素分析的文本自适应说话人验证中的说话人适应
6. A novel end-to-end method to predict RNA secondary structure profile based on bidirectional LSTM and residual neural network [O] . Linyu Wang, Xiaodan Zhong, Shuo Wang, 2021

机译：一种新的端到端方法以预测基于双向LSTM和残差神经网络的RNA二级结构谱
7. RawNet: Advanced End-to-End Deep Neural Network Using Raw Waveforms for Text-Independent Speaker Verification [O] . Jee-weon Jung, Hee-Soo Heo, Ju-ho Kim, 2019

机译：RAWENT：使用原始波形的先进端到端深神经网络用于独立于文本的扬声器验证

LSTM Based End-to-End Text-Independent Speaker Verification Using Raw Waveform

摘要

著录项

相似文献

相关主题

期刊订阅