首页> 外文会议>2015 International Conference on Soft Computing Techniques and Implementations >Speaker recognition using MFCC, shifted MFCC with vector quantization and fuzzy
【24h】

Speaker recognition using MFCC, shifted MFCC with vector quantization and fuzzy

机译:使用MFCC的说话人识别,带矢量量化的模糊MFCC和模糊

获取原文
获取原文并翻译 | 示例

摘要

In the range of biometric we consider the variability of discourse flag because of the vicinity of clam or which impressively corrupts the productivity of ASR in genuine ecological condition. Speaker-vocal attributes exist in discourse signals and because of distinctive resonances of diverse speakers speaker acknowledgment framework checks the speaker. These distinctions can be misused by extricating element vectors like Mel-Frequency Cepstral Coefficient (MFCCs) from the discourse signal. In this paper we have utilized MFCC and Shifted MFCC with Vector Quantization and fuzzy demonstrating strategies correspondingly to enhance the execution of ASR even in boisterous environment with the assistance of redesigned discourse data which are available at high recurrence in otherworldly area. The mix of fuzzy demonstrating and shifted MFCC makes an in number total calculation which has the sensibly high vigour to clamour. In exploratory results, we have discovered 10???20% upgraded precision even at 5???8dB SNR in the vicinity of music foundation, boisterous natural condition furthermore in the vicinity of repetitive sound.
机译:在生物特征识别的范围内,我们考虑到由于靠近蛤or而语篇标记的可变性,或者在真实的生态条件下会明显破坏ASR的生产率。话语信号中存在说话者-声音属性,并且由于不同说话者的独特共鸣,说话者确认框架会检查说话者。通过从话语信号中提取诸如梅尔频率倒谱系数(MFCC)之类的元素向量,可能会误用这些区别。在本文中,我们借助向量量化和模糊演示策略分别利用了MFCC和Shifted MFCC,即使是在喧闹的环境中,也可以借助重新设计的话语数据来提高ASR的执行能力,这些话语数据在其他世界中经常出现。模糊演示和移位MFCC的混合进行了总数总计计算,具有很高的活力。在探索性结果中,我们发现即使在音乐基础附近,即使在5×8dB SNR的情况下,也能提高10×20%的精度,此外在重复声音附近,还具有繁华的自然条件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号