SYLLABLE-BASED AUTOMATIC ARABIC SPEECH RECOGNITION IN NOISY-TELEPHONE CHANNEL

MOHAMED MOSTAFA AZMI; HESHAM TOLBA; SHERIF MAHDY; MERVAT FASHAL

首页> 外文期刊>WSEAS Transactions on Signal Processing >SYLLABLE-BASED AUTOMATIC ARABIC SPEECH RECOGNITION IN NOISY-TELEPHONE CHANNEL

【24h】

SYLLABLE-BASED AUTOMATIC ARABIC SPEECH RECOGNITION IN NOISY-TELEPHONE CHANNEL

机译：语音电话中基于节的自动阿拉伯语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The performance of well-trained speech recognizers using high quality full bandwidth speech data is usually degraded when used in real world environments. In particular, telephone speech recognition is extremely difficult due to the limited bandwidth of transmission channels. In this paper, we concentrate on the telephone recognition of Egyptian Arabic speech using syllables. Arabic spoken digits were described by showing their constructing phonemes, triphones, syllables and words. Speaker-independent hidden Markov models (HMMs)-based speech recognition system was designed using Hidden Markov model toolkit (HTK). The database used for both training and testing consists from forty-four Egyptian speakers. In clean environment, experiments show that the recognition rate using syllables outperformed the rate obtained using monophones, triphones and words by 2.68%, 1.19% and 1.79% respectively. Also in noisy telephone channel, syllables outperformed the rate obtained using monophones, triphones and words by 2.09%, 1.5% and 0.9% respectively. Comparative experiments have indicated that the use of syllables as acoustic units leads to an improvement in the recognition performance of HMM-based ASR systems in noisy environments. A syllable unit spans a longer time frame, typically three phones, thereby offering a more parsimonious framework for modeling pronunciation variation in spontaneous speech. Moreover, syllable-based recognition has relatively smaller number of used units and runs faster than word-based recognition.

机译：当在现实环境中使用时，使用高质量全带宽语音数据的训练有素的语音识别器的性能通常会降低。特别地，由于传输信道的带宽有限，电话语音识别非常困难。在本文中，我们重点研究使用音节对埃及阿拉伯语音的电话识别。阿拉伯语的语音数字通过显示其构成音素，三音，音节和单词来描述。使用隐马尔可夫模型工具包（HTK）设计了基于说话者无关的隐马尔可夫模型（HMM）的语音识别系统。用于培训和测试的数据库由四十四名埃及讲者组成。在干净的环境中，实验表明，使用音节的识别率分别比单音，三音和单词的识别率分别高2.68％，1.19％和1.79％。同样在嘈杂的电话频道中，音节分别比使用单音，三音和单词获得的音高2.09％，1.5％和0.9％。比较实验表明，将音节用作声学单位会导致在嘈杂环境中基于HMM的ASR系统的识别性能得到改善。一个音节单元跨越一个较长的时间范围，通常是三个电话，从而为模拟自发语音中的发音变化提供了更为简洁的框架。此外，基于音节的识别具有相对较少的使用单位数量，并且比基于单词的识别运行得更快。

著录项

来源
《WSEAS Transactions on Signal Processing》 |2008年第4期|共10页
作者
MOHAMED MOSTAFA AZMI; HESHAM TOLBA; SHERIF MAHDY; MERVAT FASHAL;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类声学;
关键词
Speech recognition; Syllables; Arabic language; HMMs; Noisy-channel;

机译：语音识别音节阿拉伯语HMM噪声通道;

相似文献

外文文献
中文文献
专利

1. SYLLABLE-BASED AUTOMATIC ARABIC SPEECH RECOGNITION IN NOISY-TELEPHONE CHANNEL [J] . MOHAMED MOSTAFA AZMI, HESHAM TOLBA, SHERIF MAHDY, WSEAS Transactions on Signal Processing . 2008,第4期

机译：语音电话中基于节的自动阿拉伯语音识别
2. Arabic Speaker-Independent Continuous Automatic Speech Recognition Based on a Phonetically Rich and Balanced Speech Corpus [J] . Mohammad Abushariah, Raja Ainon, Roziati Zainuddin, The international arab journal of information technology . 2012,第1期

机译：基于语音丰富均衡的语料库的阿拉伯语独立于说话人的连续自动语音识别
3. Modern standard Arabic speech corpus for implementing and evaluating automatic continuous speech recognition systems [J] . Mohammad Abd-Alrahman Mahmoud Abushariah, Raja Noor Ainon, Roziati Zainuddin, Journal of the Franklin Institute . 2012,第7期

机译：用于实现和评估自动连续语音识别系统的现代标准阿拉伯语语音语料库
4. Syllable-Based Automatic Arabic Speech Recognition in Noisy Enviroment [C] . Mohamed M.Azmi, Hesham Tolba 2008 International Conference on Audio，Language and Image Processing（2008国际声音、语言、图像过程大会）论文集 . 2008

机译：嘈杂环境中基于音节的自动阿拉伯语语音识别
5. Arabic language modeling with stem-derived morphemes for automatic speech recognition. [D] . Heintz, Ilana. 2010

机译：具有词干衍生语素的阿拉伯语言建模，可实现自动语音识别。
6. Formant analysis in dysphonic patients and automatic Arabic digit speech recognition [O] . Ghulam Muhammad, Tamer A Mesallam, Khalid H Malki, 2011

机译：语音障碍患者的共振峰分析和阿拉伯数字自动语音识别
7. The effects of speakers' gender, age, and region on overall performance of Arabic automatic speech recognition systems using the phonetically rich and balanced Modern Standard Arabic speech corpus [O] . Sawalha M, Abu Shariah M 2013

机译：发言者的性别，年龄和地区对使用语音丰富和平衡的现代标准阿拉伯语言语料库的阿拉伯语自动语音识别系统整体表现的影响

SYLLABLE-BASED AUTOMATIC ARABIC SPEECH RECOGNITION IN NOISY-TELEPHONE CHANNEL

摘要

著录项

相似文献

相关主题

期刊订阅