首页> 外文期刊>Journal of computer sciences >QUALITY BASED SPEAKER VERIFICATION SYSTEMS USING FUZZY INFERENCE FUSION SCHEME
【24h】

QUALITY BASED SPEAKER VERIFICATION SYSTEMS USING FUZZY INFERENCE FUSION SCHEME

机译:基于模糊推理融合方案的基于质量的说话人验证系统

获取原文
获取原文并翻译 | 示例
       

摘要

Performances of single biometric speaker verification systems are outstanding in clean condition but drop significantly in noisy condition. Implementation of multibiometric systems is one of the solutions to this limitation. However, in order to ensure the performances of multibiometric systems are sustained, the optimum weight for the fusion system must be determined correctly according to the quality of current data. This study proposes the use of Fuzzy Inference System for weight inference. Two traits i.e., speech and lip are used while Support Vector Machine (SVM) is employed as the classifier in this study. The speech features are extracted using the Mel Frequency Cepstrum Coefficient (MFCC) method and the lip features are extracted using Region of Interest (ROI) method. The performances of single modal system (i.e., speech and lip) and multibiometric systems with sugeno and mamdani approaches are compared at different quality conditions in this study. Experimental results prove that the use of Fuzzy Inference System as weight inference is a very promising approach. For 15 dB SNR speech signal and 0.2 lip quality density, the GAR performances at FAR equals 0.1% for Mamdani-type, Sugeno-type, lip and speech systems are observed as 94, 95, 86 and 7%, respectively. In short, the proposed fusion scheme based on Fuzzy logic is able to maintain the performance of fusion system especially when one of the biometric sources is in noisy condition due to its capability to infer the correct fusion weight according to current data quality.
机译:单个生物特征说话人验证系统在清洁条件下的性能出色,但在嘈杂条件下的性能却明显下降。多生物学系统的实现是解决此限制的方法之一。但是,为了确保多生物系统的性能得以维持,必须根据当前数据的质量正确确定融合系统的最佳权重。本研究提出使用模糊推理系统进行权重推理。在本研究中,使用支持向量机(SVM)作为分类器时使用了语音和嘴唇这两个特征。使用梅尔频率倒谱系数(MFCC)方法提取语音特征,并使用感兴趣区域(ROI)方法提取嘴唇特征。在本研究中,比较了使用sugeno和mamdani方法的单模态系统(即语音和口语)和多生物系统的性能。实验结果证明,将模糊推理系统作为权重推理是一种很有前途的方法。对于15 dB SNR语音信号和0.2唇质量密度,对于Mamdani型,Sugeno型,唇和语音系统,FAR的GAR性能分别为0.1%,94%,95%,86%和7%。简而言之,所提出的基于模糊逻辑的融合方案能够保持融合系统的性能,特别是当生物特征来源之一处于嘈杂状态时,因为其能够根据当前数据质量推断正确的融合权重。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号