首页> 外文期刊>Asian Journal of Pharmaceutical and Clinical Research >VOICE RECOGNITION SECURITY SYSTEM USING MEL FREQUENCY CEPSTRUM COEFFICIENTS
【24h】

VOICE RECOGNITION SECURITY SYSTEM USING MEL FREQUENCY CEPSTRUM COEFFICIENTS

机译:使用MEL倒谱系数的语音识别安全系统

获取原文
获取外文期刊封面目录资料

摘要

Voice Recognition is a fascinating field spanning several areas of computer science and mathematics. Reliable speaker recognition is a hard problem, requiring a combination of many techniques; however modern methods have been able to achieve an impressive degree of accuracy. This project attempts to examine those techniques, and to apply them to build a simple voice recognition system. The project is implemented on software which uses different techniques such as Mel frequency Cepstrum Coefficient (MFCC), Vector Quantization (VQ) which are implemented using MATLAB. MFCC is used to extract the characteristics from the input speech signal with respect to a particular word uttered by a particular speaker. VQ codebook is generated by clustering the training feature vectors of each speaker and then stored in the speaker database. Verification of the speaker is carried out using Euclidian Distance. For voice recognition we implement the MFCC approach using software platform MatlabR2013b.
机译:语音识别是一个引人入胜的领域,涵盖了计算机科学和数学的多个领域。可靠的说话人识别是一个难题,需要多种技术的结合。但是,现代方法已经能够实现令人印象深刻的准确性。该项目尝试检查这些技术,并将其应用于构建简单的语音识别系统。该项目在使用不同技术的软件上实现,例如使用MATLAB实现的梅尔频率倒谱系数(MFCC),矢量量化(VQ)。 MFCC用于从输入语音信号中提取特定说话者发出的特定单词的特征。通过对每个说话者的训练特征向量进行聚类来生成VQ码本,然后将其存储在说话者数据库中。使用欧几里得距离对说话者进行验证。对于语音识别,我们使用软件平台MatlabR2013b实现MFCC方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号