首页> 外文期刊>International journal of speech technology >Time Scale Modification And Vocal Tract Length Normalization For Improving The Performance Of Tamil Speech Recognition System Implemented Using Language Independent Segmentation Algorithm
【24h】

Time Scale Modification And Vocal Tract Length Normalization For Improving The Performance Of Tamil Speech Recognition System Implemented Using Language Independent Segmentation Algorithm

机译:时标修改和声道长度归一化以提高使用语言独立分割算法实现的泰米尔语语音识别系统的性能

获取原文
获取原文并翻译 | 示例
       

摘要

This paper describes the work done in improving the performance of Tamil speech recognition system by using Time Scale Modification (TSM) and Vocal Tract Length Normalization (VTLN) techniques. The speech recognition system for Tamil language was developed using a new approach of text independent speech segmentation, with a phoneme based language model for recognition. There is degradation in the performance of speech recognition due to variations in the speaking rate and vocal tract shape among different speakers. In order to improve the performance of speech recognition system, both TSM and VTLN normalization techniques were used in this work. The TSM was implemented using the Phase vocoder approach and the VTLN was implemented using speaker specific bark/mel scale in bark/mel domain. The performance of Tamil speech recognition system was improved by performing both TSM and VTLN normalization techniques.
机译:本文介绍了通过使用时标修改(TSM)和人声道长度归一化(VTLN)技术来提高泰米尔语语音识别系统性能的工作。泰米尔语的语音识别系统是使用一种新的独立于文本的语音分割方法开发的,并具有基于音素的语言模型进行识别。由于不同说话者之间的语速和声道形状的变化,语音识别性能会下降。为了提高语音识别系统的性能,这项工作中同时使用了TSM和VTLN归一化技术。 TSM是使用相位声码器方法实现的,而VTLN是使用树皮/梅尔域中特定于说话者的树皮/梅尔音阶来实现的。通过执行TSM和VTLN归一化技术,可以改善泰米尔语语音识别系统的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号