首页> 外文会议>IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems >Significance of speech enhancement and sonorant regions of speech for robust language identification
【24h】

Significance of speech enhancement and sonorant regions of speech for robust language identification

机译:语音增强和言语言论的意义稳健语言识别

获取原文

摘要

A high degree of robustness is a prerequisite to operate speech and language processing systems in practical environments. Performance of these systems is highly influenced by varying and mixed background environments. In this paper, we put forward a robust method for automatic language identification in various background environments. Combined temporal and spectral processing method is used as a preprocessing technique for enhancing the degraded speech. Language discriminative information in high sonority regions of speech is used for the task of language identification. Sonority regions are regions of speech whose signal energy is high and these regions are less influenced by background environments. Spectral energy of formants in the glottal closure regions is employed as an acoustic correlate for the detection of sonority regions of speech. In this paper performance of the LID system is studied in various background environments like clean room, car factory, high frequency, pink and white noise environments. In this work, Indian Institute of Technology Kharagpur - Multi Lingual Indian Language Speech Corpus (IITKGP-MLILSC) is used for building language identification system. Noise speech samples from the NOISEX database are employed in the present study. The performance of the proposed method is quite satisfactory compared to existing approaches.
机译:高度的鲁棒性是在实际环境中操作语音和语言处理系统的先决条件。这些系统的性能受到不同和混合背景环境的影响。在本文中,我们提出了一种在各种背景环境中的自动语言识别的鲁棒方法。组合的时间和光谱处理方法用作增强劣化语音的预处理技术。语言中的语言歧视信息语音中的语言区域用于语言识别的任务。 Sonority地区是信号能量高,这些区域受到背景环境影响的地区。光学封闭区域中的中常体的光谱能量被用作声学相关性的声学相关性。在本文的纸张性能下,在各种背景环境中研究了洁净室,汽车厂,高频,粉红色和白噪声环境。在这项工作中,印度理工学院Kharagpur - 多语言印度语言语音语料库(IITKGP-MLILSC)用于构建语言识别系统。来自诊断数据库的噪声语音在本研究中采用。与现有方法相比,所提出的方法的性能非常令人满意。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号