Significance of speech enhancement and sonorant regions of speech for robust language identification

机译：语音增强和言语言论的意义稳健语言识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A high degree of robustness is a prerequisite to operate speech and language processing systems in practical environments. Performance of these systems is highly influenced by varying and mixed background environments. In this paper, we put forward a robust method for automatic language identification in various background environments. Combined temporal and spectral processing method is used as a preprocessing technique for enhancing the degraded speech. Language discriminative information in high sonority regions of speech is used for the task of language identification. Sonority regions are regions of speech whose signal energy is high and these regions are less influenced by background environments. Spectral energy of formants in the glottal closure regions is employed as an acoustic correlate for the detection of sonority regions of speech. In this paper performance of the LID system is studied in various background environments like clean room, car factory, high frequency, pink and white noise environments. In this work, Indian Institute of Technology Kharagpur - Multi Lingual Indian Language Speech Corpus (IITKGP-MLILSC) is used for building language identification system. Noise speech samples from the NOISEX database are employed in the present study. The performance of the proposed method is quite satisfactory compared to existing approaches.

机译：高度的鲁棒性是在实际环境中操作语音和语言处理系统的先决条件。这些系统的性能受到不同和混合背景环境的影响。在本文中，我们提出了一种在各种背景环境中的自动语言识别的鲁棒方法。组合的时间和光谱处理方法用作增强劣化语音的预处理技术。语言中的语言歧视信息语音中的语言区域用于语言识别的任务。 Sonority地区是信号能量高，这些区域受到背景环境影响的地区。光学封闭区域中的中常体的光谱能量被用作声学相关性的声学相关性。在本文的纸张性能下，在各种背景环境中研究了洁净室，汽车厂，高频，粉红色和白噪声环境。在这项工作中，印度理工学院Kharagpur - 多语言印度语言语音语料库（IITKGP-MLILSC）用于构建语言识别系统。来自诊断数据库的噪声语音在本研究中采用。与现有方法相比，所提出的方法的性能非常令人满意。

著录项

来源
《IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems》|2015年||共5页
会议地点
作者
Kumar Vuppala Anil; Mounika K.V.; Vydana Hari Krishna;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN911.7-53;
关键词
Automatic language identification; combined temporal and spectral processing; formant frequencies; glottal closure region; sonority regions; various background environments;

机译：自动语言识别;组合时间和光谱处理;形成型频率;引光闭合区域;超声区域;各种背景环境;

相似文献

外文文献
中文文献
专利

1. Enhancing Comprehension of Lecture Content in a Foreign Language as the Medium of Instruction: Comparing Speech-to-Text Recognition With Speech-Enabled Language Translation [J] . Rustam Shadiev, Yu-Cheng Chien, Yueh-Min Huang SAGE Open . 2020,第3期

机译：以外语为讲座内容的理解为教学媒介：将语音到文本识别与启用语音的语言翻译进行比较
2. Subjective and Objective Analysis of Speech Enhancement Algorithms for Single Channel Speech Patterns of Indian and English Languages [J] . Sachin Singh, Manoj Tripathy, R. S. Anand IETE Technical Review . 2014,第1期

机译：印度和英语单通道语音模式语音增强算法的主观和客观分析
3. Combination of GMM-Based Speech Estimation Method and Temporal Domain SVD-Based Speech Enhancement for Noise Robust Speech Recognition [J] . Masakiyo Fujimoto, Yasuo Ariki Systems and Computers in Japan . 2007,第3期

机译：基于GMM的语音估计方法与基于时域SVD的语音增强相结合的噪声鲁棒语音识别
4. Significance of speech enhancement and sonorant regions of speech for robust language identification [C] . Kumar Vuppala Anil, Mounika K.V., Vydana Hari Krishna IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems . 2015

机译：语音增强和语音回声区域对于可靠的语言识别的意义
5. Speech enhancement for robust speech communication. [D] . Deng, Ying. 2006

机译：语音增强功能可实现强大的语音通信。
6. Cued Speech for Enhancing Speech Perception and First Language Development of Children With Cochlear Implants [O] . Jacqueline Leybaert, Carol J. LaSasso 2010

机译：提示语音以增强人工耳蜗植入儿童的语音知觉和母语发展
7. Robust Automatic Continuous Speech Segmentation for Indian Languages to Improve Speech to Speech Translation [O] . J. Sangeetha, S. Jothilakshmi 2013

机译：强大的印度语言自动连续语音分割功能，可改善语音到语音的翻译
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Significance of speech enhancement and sonorant regions of speech for robust language identification

摘要

著录项

相似文献

相关主题

期刊订阅