Audio-Visual Based Multi-Sample Fusion to Enhance Correlation Filters Speaker Verification System

Dzati Athiar Ramli; Salina Abdul Samad; Aini Hussain

首页> 外文期刊>International Journal on Computer Science and Engineering >Audio-Visual Based Multi-Sample Fusion to Enhance Correlation Filters Speaker Verification System

【24h】

Audio-Visual Based Multi-Sample Fusion to Enhance Correlation Filters Speaker Verification System

机译：基于视听的多样本融合以增强相关滤波器说话者验证系统

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this study, we propose a novel approach for speaker verification system that uses a spectrogram image as features and Unconstrained Minimum Average Correlation Energy (UMACE) filters as classifiers. Since speech signal is a behavioral signal, the speech data has a tendency not to consistently reproduce due to the change of speaking rates, health, emotional conditions, temperature and humidity. In order to overcome this problem, a modification of UMACE filters architecture is proposed by executing a multi-sample fusion using speech and lipreading data. So as to evaluate the outstanding fusion scheme, five multi-sample fusion strategies, i.e. maximum, minimum, median, average and majority vote are first experimented using the speech signal data. Afterward, the performance of the audio-visual system using the enhanced UMACE filters is then tested. Here, lipreading data is combined to the audio samples pool and the outstanding fusion scheme that found in prior experiment is used as multi-sample fusion scheme. The Digit Database had been used for performance evaluation and the performance up to 99.64% is achieved by using the enhanced UMACE filters for the speech only system which is 6.89% improvement compared with the base line approach. Subsequently, the implementation of the audio-visual system is observed to be significant in order to broaden the PSR score interval between the authentic and imposter data as well as to further improve the performance of audio only system that offer toward a robust verification system.

机译：在这项研究中，我们为说话者验证系统提出了一种新颖的方法，该方法使用频谱图图像作为特征，并使用无约束最小平均相关能量（UMACE）滤波器作为分类器。由于语音信号是行为信号，因此语音数据由于语音速率，健康状况，情绪状况，温度和湿度的变化而具有不一致地再现的趋势。为了克服该问题，提出了通过使用语音和唇读数据执行多样本融合来对UMACE滤波器架构进行修改的提议。为了评估杰出的融合方案，首先使用语音信号数据对五个多样本融合策略进行了试验，即最大，最小，中位数，平均和多数投票。然后，然后使用增强的UMACE滤波器测试视听系统的性能。在这里，将唇读数据组合到音频样本池中，并且将先前实验中发现的出色融合方案用作多样本融合方案。 Digit数据库已用于性能评估，通过对语音系统使用增强的UMACE滤波器，可以将性能提高到99.64％，比基线方法提高了6.89％。随后，为了扩大真实数据和冒名顶替者数据之间的PSR分数间隔，并进一步提高仅提供音频的系统的性能，可以观察到视听系统的实现意义重大，从而可以提供一种可靠的验证系统。

著录项

来源
《International Journal on Computer Science and Engineering》 |2010年第4期|共9页
作者
Dzati Athiar Ramli; Salina Abdul Samad; Aini Hussain;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. QUALITY BASED SPEAKER VERIFICATION SYSTEMS USING FUZZY INFERENCE FUSION SCHEME [J] . Lydia Abdul Hamid, Dzati Athiar Ramli Journal of computer sciences . 2014,第3期

机译：基于模糊推理融合方案的基于质量的说话人验证系统
2. QUALITY BASED SPEAKER VERIFICATION SYSTEMS USING FUZZY INFERENCE FUSION SCHEME | Science Publications [J] . Dzati Athiar Ramli, Lydia Abdul Hamid Journal of computer sciences . 2013,第3期

机译：模糊推理融合方案的基于质量的说话人验证系统科学出版物
3. Speaker Recognition System Based on the Baseband Correlation Score Reliability Fusion [J] . Qi He, Ting Huang, Hongbo Zhang Communications and Network . 2013,第3期

机译：基于基带相关评分可靠性融合的扬声器识别系统
4. A Comparative Study on Multi-Sample Fusion Schemes to Enhance Spectrographic Speaker Verification [C] . SALINA ABDUL SAMAD, DZATI ATHIAR RAMLI, AINI HUSSAIN WSEAS International Conferences . 2007

机译：多样化融合方案提高光谱扬声器验证的比较研究
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. Speaker verification based on the fusion of speech acoustics and inverted articulatory signals [O] . Ming Li, Jangwon Kim, Adam Lammert, -1

机译：基于语音声学和反向发音信号融合的说话人验证
7. QUALITY BASED SPEAKER VERIFICATION SYSTEMS USING FUZZY INFERENCE FUSION SCHEME [O] . Lydia Abdul Hamid, Dzati Athiar Ramli 2015

机译：基于模糊推理融合方案的基于质量的语音验证系统

Audio-Visual Based Multi-Sample Fusion to Enhance Correlation Filters Speaker Verification System

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅