首页> 外文期刊>International Journal of Applied Engineering Research >Speaker Recognition by Gaussian Filter Based Feature Extraction and Proposed Fuzzy Vector Quantization Modelling Technique
【24h】

Speaker Recognition by Gaussian Filter Based Feature Extraction and Proposed Fuzzy Vector Quantization Modelling Technique

机译:基于高斯滤波器的特征提取和提出模糊矢量量化建模技术的扬声器识别

获取原文
获取原文并翻译 | 示例
           

摘要

Automatic speaker-recognition systems (ASR) have developed as a vital method for recognition of a person in many applications like e-commerce as well as in common interactions, law enforcement, and forensics. The performance of automatic speaker recognition system depends on the duration of the voice of test and train samples. In this paper, I address the issues in the design of codebook for outlier effect using the fuzzy vector quantization based algorithm for tasks such as speaker recognition for short voice samples. Particularly, I address the issues in partitioning the voice data for matching sequences of speaker specific feature vectors extracted from the voice signal data of utterances of a speaker. The design of codebook based on fuzzy vector quantization (FVQ) and fuzzy c-means (FCM) vector quantization has been proposed previously for matching the test and train voice signal characterized as sets of speaker-specific features vectors for tasks such as distortion measure and speaker recognition. Speaker-specific features Mel Frequency Cepstral Coefficients (MFCC) extracted by the traditional triangular filter as well as Gaussian and Tukey shapes filter. This research paper compares experimental results of three different modelling techniques namely, Fuzzy c-means, Fuzzy Vector Quantization2 (FVQ2) and proposed fuzzy vector quantization (FVQ). ASR efficiency of FVQ shows significant improvement in performance compared to FCM and FVQ2. The efficiency of proposed ASR system is 98.8% for 2 seconds of training voice data for a set of 100 speakers taken from the Texas Instruments and Massachusetts Institute of Technology (TIMIT) database.
机译:自动扬声器识别系统(ASR)已开发为识别许多应用中的人的重要方法,如电子商务,以及共同的互动,执法和取证。自动扬声器识别系统的性能取决于测试和列车样本的持续时间。在本文中,我使用基于模糊矢量量化的基于任务的算法来解决码本的码表效果设计中的问题,例如短语样本的扬声器识别。特别地,我解决了从扬声器的话语的语音信号数据提取的扬声器特定特征向量匹配匹配序列的语音数据的问题。先前提出了基于模糊矢量量化(FVQ)和模糊C-MEARACH(FCM)矢量量化的码本的设计,用于匹配测试和列车语音信号,其特征在于扬声器特征载体组,用于诸如失真测量的任务和扬声器识别。由传统三角形滤波器提取的扬声器特定特征MEL频率谱系系数(MFCC)以及高斯和TUKEY形状过滤器。本研究论文将三种不同的建模技术的实验结果进行了比较,模糊C型均值,模糊矢量量化2(FVQ2)和提出的模糊矢量量化(FVQ)。与FCM和FVQ2相比,FVQ的ASR效率显示出性能的显着改善。拟议ASR系统的效率为2秒的培训语音数据为一组100个扬声器,从德州仪器和马萨诸塞州理工学院(Timit)数据库中拍摄了一组100扬声器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号