Limited data speaker identi?cation

H S Jayanna; S R Mahadeva Prasanna

首页> 外文期刊>Sadhana >Limited data speaker identi?cation

【24h】

Limited data speaker identi?cation

机译：有限的数据说话人识别

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, the task of identifying the speaker using limited training and testing data is addressed. Speaker identi?cation system is viewed as four stages namely, analysis, feature extraction, modelling and testing. The speaker identi?cation performance depends on the techniques employed in these stages. As demonstrated by different experiments, in case of limited training and testing data condition, owing to less data, existing techniques in each stage will not provide good performance. This work demonstrates the following: multiple frame size and rate (MFSR) analysis provides improvement in the analysis stage, combination of mel frequency cepstral coef?cients (MFCC), its temporal derivatives $(Delta,Delta Delta)$, linear prediction residual (LPR) and linear prediction residual phase (LPRP) features provides improvement in the feature extraction stage and combination of learning vector quantization (LVQ) and gaussian mixture model – universal background model (GMM–UBM) provides improvement in the modelling stage. The performance is further improved by integrating the proposed techniques at the respective stages and combining the evidences from them at the testing stage. To achieve this, we propose strength voting (SV), weighted borda count (WBC) and supporting systems (SS) as combining methods at the abstract, rank and measurement levels, respectively. Finally, the proposed hierarchical combination (HC) method integrating these three methods provides signi?cant improvement in the performance. Based on these explorations, this work proposes a scheme for speaker identi?cation under limited training and testing data.

机译：在本文中，解决了使用有限的培训和测试数据来确定说话者的任务。说话人识别系统被视为四个阶段，即分析，特征提取，建模和测试。说话人识别性能取决于这些阶段中使用的技术。正如不同实验所证明的那样，在训练和测试数据条件有限的情况下，由于数据量较少，每个阶段的现有技术都无法提供良好的性能。这项工作演示了以下内容：多帧大小和速率（MFSR）分析在分析阶段提供了改进，梅尔频率倒谱系数（MFCC）的组合，其时间导数$（ Delta， Delta Delta）$，线性预测残差（LPR）和线性预测残差相位（LPRP）特征在特征提取阶段提供了改进，并且学习矢量量化（LVQ）和高斯混合模型的组合–通用背景模型（GMM–UBM）在建模阶段提供了改进。通过在各个阶段集成提议的技术并在测试阶段组合来自它们的证据，可以进一步提高性能。为此，我们建议采用强度投票（SV），加权博达计数（WBC）和支持系统（SS）作为分别在抽象，等级和度量级别上的组合方法。最后，所提出的将这三种方法结合在一起的分层组合（HC）方法可显着提高性能。基于这些探索，这项工作提出了在有限的训练和测试数据下用于说话人识别的方案。

著录项

来源
《Sadhana》 |2010年第5期|共22页
作者
H S Jayanna; S R Mahadeva Prasanna;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类一般工业技术;
关键词

相似文献

外文文献
中文文献
专利

1. Limited-data automatic speaker verification algorithm using band-limited phase-only correlation function [J] . ángel PEDROZA, José De La ROSA, José De Jesus VILLA, Turkish Journal of Electrical Engineering and Computer Sciences . 2019,第4期

机译：使用带限量相位相关函数的限制数据自动扬声器验证算法
2. Developing speaker independent ASR system using limited data through prosody modification based on fuzzy classification of spectral bins [J] . Shahnawazuddin S., Adiga Nagaraj, Sai B. Tarun, Digital Signal Processing . 2019,第期

机译：通过基于频谱箱的模糊分类，通过韵律修改发展扬声器独立的ASR系统
3. Speaker Recognition System for Limited Speech Data Using High-Level Speaker Specific Features and Support Vector Machines [J] . Satyanand Singh, Assaf Mansour H., Nitin Agarwal, International Journal of Applied Engineering Research . 2017,第19aPta1期

机译：使用高级扬声器特定功能和支持向量机有限语音数据的扬声器识别系统
4. In-Domain and Out-of-Domain Data Augmentation to Improve Children’s Speaker Verification System in Limited Data Scenario [C] . S. Shahnawazuddin, Waquar Ahmad, Nagaraj Adiga, IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：领域内和领域外数据扩充，以在有限的数据场景中改善儿童说话者验证系统
5. Environmental and speaker robustness in automatic speech recognition with limited learning data. [D] . Cui, Xiaodong. 2005

机译：具有有限学习数据的自动语音识别中的环境和说话者鲁棒性。
6. Correction: Severity-Based Adaptation with Limited Data for ASR to Aid Dysarthric Speakers [O] . -1

机译：校正：基于严重性的自适应数据自适应ASR辅助说话者说话者
7. Improving PLDA speaker verification using WMFD and linear-weighted approaches in limited microphone data conditions [O] . Kanagasundaram Ahilan, Dean David B., Sridharan Sridha 2015

机译：在有限的麦克风数据条件下，使用WMFD和线性加权方法改善PLDA扬声器验证

Limited data speaker identi?cation

摘要

著录项

相似文献

相关主题

期刊订阅