TAMEEM V1.0: speakers and text independent Arabic automatic continuous speech recognizer

Mohammad A. M. Abushariah

首页> 外文期刊>International journal of speech technology >TAMEEM V1.0: speakers and text independent Arabic automatic continuous speech recognizer

【24h】

TAMEEM V1.0: speakers and text independent Arabic automatic continuous speech recognizer

机译：TAMEEM V1.0：独立于扬声器和文本的阿拉伯语自动连续语音识别器

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This research work aims to disseminate the efforts towards developing and evaluating TAMEEM Vl.O, which is a state-of-the-art pure Modern Standard Arabic (MSA), automatic, continuous, speaker independent, and text independent speech recognizer using high proportion of the spoken data of the phonetically rich and balanced MSA speech corpus. The speech corpus contains speech recordings of Arabic native speakers from 11 Arab countries representing Levant, Gulf, and Africa regions of the Arabic World, which make about 45.30 h of speech data. The recordings contain about 39.28 h of 367 sentences that are considered phonetically rich and balanced, which are used for training TAMEEM V1.0 speech recognizer, and another 6.02 h of another 48 sentences that are used for testing purposes, which are mostly text independent and foreign to the training sentences. TAMEEM V1.0 speech recognizer is developed using the Carnegie Mellon University (CMU) Sphinx 3 tools in order to evaluate the speech corpus, whereby the speech engine uses three-emitting state Continuous Density Hidden Markov Model for tri-phone based acoustic models, and the language model contains uni-grams, bi-grams, and tri-grams. Using three different testing data sets, this work obtained 7.64% average Word Error Rate (WER) for speakers dependent with text independent data set. For speakers independent with text dependent data set, this work obtained 2.22% average WER, whereas 7.82% average WER is achieved for speakers independent with text independent data set.

机译：这项研究工作旨在传播开发和评估TAMEEM Vl.O的努力，TAMEEM Vl.O是最先进的纯现代标准阿拉伯语（MSA），自动，连续，独立于说话者和文本独立的语音识别器，使用高比例语音丰富且平衡的MSA语音语料库的语音数据。语音语料库包含来自11个阿拉伯国家（代表阿拉伯世界的黎凡特，海湾和非洲地区）的阿拉伯语母语人士的语音记录，这些语音记录约占45.30小时。录音包含约367个句子中的39.28小时，这些句子被认为在语音上丰富且均衡，用于训练TAMEEM V1.0语音识别器；另外还有6.02 h，另外48个句子用于测试目的，这些句子大部分与文本无关，外来训练的句子。 TAMEEM V1.0语音识别器是使用卡内基梅隆大学（CMU）的Sphinx 3工具开发的，用于评估语音语料库，其中语音引擎将三发射状态连续密度隐藏马尔可夫模型用于基于三电话的声学模型，并且语言模型包含单字组，二元组和三元组。使用三个不同的测试数据集，这项工作获得了依赖文本独立数据集的说话人的平均单词错误率（WER）为7.64％。对于独立于文本依赖数据集的说话者，这项工作获得了2.22％的平均WER，而独立于文本依赖数据集的说话者获得了7.82％的平均WER。

著录项

来源
《International journal of speech technology》 |2017年第2期|261-280|共20页
作者
Mohammad A. M. Abushariah;
展开▼
作者单位

Department of Computer Information Systems, King Abdullah II School for Information Technology, The University of Jordan, Amman, Jordan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Modern Standard Arabic; Text corpus; Speech corpus; Phonetically rich; Phonetically balanced; Automatic continuous speech recognition;

机译：现代标准阿拉伯语;文本语料库;语料库;语音丰富;语音平衡;自动连续语音识别;

相似文献

外文文献
中文文献
专利

1. Arabic Speaker-Independent Continuous Automatic Speech Recognition Based on a Phonetically Rich and Balanced Speech Corpus [J] . Mohammad Abushariah, Raja Ainon, Roziati Zainuddin, The international arab journal of information technology . 2012,第1期

机译：基于语音丰富均衡的语料库的阿拉伯语独立于说话人的连续自动语音识别
2. The benefit obtained from visually displayed text from an automatic speech recognizer during listening to speech presented in noise. [J] . Zekveld AA, Kramer SE, Kessens JM, Ear and hearing. . 2008,第6期

机译：从自动语音识别器以可视方式显示的文本在收听以噪声呈现的语音时获得的好处。
3. Modern standard Arabic speech corpus for implementing and evaluating automatic continuous speech recognition systems [J] . Mohammad Abd-Alrahman Mahmoud Abushariah, Raja Noor Ainon, Roziati Zainuddin, Journal of the Franklin Institute . 2012,第7期

机译：用于实现和评估自动连续语音识别系统的现代标准阿拉伯语语音语料库
4. Phonetically rich and balanced speech corpus for Arabic speaker-independent continuous automatic speech recognition systems [C] . Abushariah Mohammad A. M., Ainon Raja N., Zainuddin Roziati, 10th International Conference on Information Sciences Signal Processing and their Applications . 2010

机译：具有语音丰富且平衡的语音语料库，用于独立于阿拉伯语的连续自动语音识别系统
5. Real-time speaker -independent large vocabulary continuous speech recognition. [D] . Li, Xiaolong. 2005

机译：实时独立于说话者的大词汇量连续语音识别。
6. Towards understanding speaker discrimination abilities in humans and machines for text-independent short utterances of different speech styles [O] . Soo Jin Park, Gary Yeung, Neda Vesselinova, -1

机译：旨在理解人和机器中说话者的辨别能力以实现不同语音风格的与文本无关的简短发声
7. A Speaker Independent Continuous Speech Recognizer for Amharic [O] . Seid Hussien, Gambäck Björn 2005

机译：演讲者独立的阿姆哈拉语连续语音识别器

TAMEEM V1.0: speakers and text independent Arabic automatic continuous speech recognizer

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅