Modern standard Arabic speech corpus for implementing and evaluating automatic continuous speech recognition systems

Mohammad Abd-Alrahman Mahmoud Abushariah; Raja Noor Ainon; Roziati Zainuddin; Assal Ali Mustafa Alqudah; Moustafa Elshafei Ahmed; Othman Omran Khalifa

首页> 外文期刊>Journal of the Franklin Institute >Modern standard Arabic speech corpus for implementing and evaluating automatic continuous speech recognition systems

【24h】

Modern standard Arabic speech corpus for implementing and evaluating automatic continuous speech recognition systems

机译：用于实现和评估自动连续语音识别系统的现代标准阿拉伯语语音语料库

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents our work towards developing a new speech corpus for Modern Standard Arabic (MSA), which can be used for implementing and evaluating Arabic speaker-independent, large vocabulary, automatic, and continuous speech recognition systems. The speech corpus was recorded by 40 (20 male and 20 female) Arabic native speakers from 11 countries representing three major regions (Levant, Gulf, and Africa). Three development phases were conducted based on the size of training data, Gaussian mixture distributions, and tied states (senones). Based on our third development phase using 11 hours of training speech data, the acoustic model is composed of 16 Gaussian mixture distributions and the state distributions tied to 300 senones. Using three different data sets, the third development phase obtained 94.32% and 8.10% average word recognition correctness rate and average Word Error Rate (WER), respectively, for same speakers with different sentences (testing sentences). For different speakers with same sentences (training sentences), this work obtained 98.10% and 2.67% average word recognition correctness rate and average WER, respectively, whereas for different speakers with different sentences (testing sentences) this work obtained 93.73% and 8.75% average word recognition correctness rate and average WER,　respectively.

机译：本文介绍了我们为现代现代阿拉伯语（MSA）开发新的语料库的工作，该语料库可用于实现和评估与阿拉伯语无关的，大词汇量，自动和连续语音识别系统。来自三个主要地区（黎凡特，海湾和非洲）的11个国家的40位阿拉伯语母语人士（其中20位男性和20位女性）录制了语音语料库。根据训练数据的大小，高斯混合分布和束缚态（senones）进行了三个开发阶段。基于我们的第三个开发阶段，使用11个小时的训练语音数据，声学模型由16个高斯混合分布和与300个senone关联的状态分布组成。使用三个不同的数据集，第三发展阶段对于具有不同句子（测试句子）的同一说话者，分别获得94.32％和8.10％的平均单词识别正确率和平均单词错误率（WER）。对于具有相同句子（训练句子）的不同说话者，这项工作分别获得了平均单词识别正确率和平均WER的98.10％和2.67％，而对于具有不同句子（测试句子）的不同说话者，该作品获得了93.73％和8.75％的平均分数。单词识别正确率和平均WER。

著录项

来源
《Journal of the Franklin Institute》 |2012年第7期|p.2215-2242|共28页
作者
Mohammad Abd-Alrahman Mahmoud Abushariah; Raja Noor Ainon; Roziati Zainuddin; Assal Ali Mustafa Alqudah; Moustafa Elshafei Ahmed; Othman Omran Khalifa;
展开▼
作者单位

Faculty of Computer Science and Information Technology, University of Malaya, 50603 Kuala Lumpur, Malaysia,King Abdullah II School for Information Technology, University of Jordan, 11942, Amman, Jordan;

Faculty of Computer Science and Information Technology, University of Malaya, 50603 Kuala Lumpur, Malaysia;

Faculty of Computer Science and Information Technology, University of Malaya, 50603 Kuala Lumpur, Malaysia;

Faculty of Computer Science and Information Technology, University of Malaya, 50603 Kuala Lumpur, Malaysia;

Department of Systems Engineering, King Fahd University of Petroleum and Minerals, KFUPM Box 405, Dhahran 31261, Saudi Arabia;

Electrical and Computer Engineering Department, Faculty of Engineering, International Islamic University Malaysia, Gombak, 53100 Kuala Lumpur, Malaysia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
入库时间 2022-08-18 02:57:59

相似文献

外文文献
中文文献
专利

1. Arabic Speaker-Independent Continuous Automatic Speech Recognition Based on a Phonetically Rich and Balanced Speech Corpus [J] . Mohammad Abushariah, Raja Ainon, Roziati Zainuddin, The international arab journal of information technology . 2012,第1期

机译：基于语音丰富均衡的语料库的阿拉伯语独立于说话人的连续自动语音识别
2. ALGERIAN ARABIC SPEECH DATABASE (ALGASD): CORPUS DESIGN AND AUTOMATIC SPEECH RECOGNITION APPLICATION [J] . Ghania Droua-Hamdani, Sid Ahmed Selouani, IMalika Boudraa The Arabian journal for science and engineering . 2010,第2C期

机译：阿尔及利亚阿拉伯语语音数据库（ALGASD）：语料库设计和自动语音识别应用
3. Algerian Modern Colloquial Arabic Speech Corpus (AMCASC): regional accents recognition within complex socio-linguistic environments [J] . Djellab Mourad, Amrouche Abderrahmane, Bouridane Ahmed, Language Resources and Evaluation . 2017,第3期

机译：阿尔及利亚现代口语阿拉伯语语料库（AMCASC）：在复杂的社会语言环境中识别区域口音
4. Impact of a Newly Developed Modern Standard Arabic Speech Corpus on Implementing and Evaluating Automatic Continuous Speech Recognition Systems [C] . Mohammad A.M. Abushariah, Raja N. Ainon, Roziati Zainuddin, Spoken dialogue systems for ambient environments . 2010

机译：新开发的现代标准阿拉伯语语音语料库对实施和评估自动连续语音识别系统的影响
5. A multimodal fusion approach for automatic postal address recognition system using Optical Character Recognition (OCR) and Automatic Speech Recognition (ASR) techniques. [D] . Singh, Amriteshwar. 2011

机译：一种使用光学字符识别（OCR）和自动语音识别（ASR）技术的自动邮政地址识别系统的多模式融合方法。
6. Towards spoken clinical-question answering: evaluating and adapting automatic speech-recognition systems for spoken clinical questions [O] . Feifan Liu, Gokhan Tur, Dilek Hakkani-Tür, 2011

机译：走向口语临床问题的答案：针对口语临床问题评估和改编自动语音识别系统
7. The effects of speakers' gender, age, and region on overall performance of Arabic automatic speech recognition systems using the phonetically rich and balanced Modern Standard Arabic speech corpus [O] . Sawalha M, Abu Shariah M 2013

机译：发言者的性别，年龄和地区对使用语音丰富和平衡的现代标准阿拉伯语言语料库的阿拉伯语自动语音识别系统整体表现的影响
8. Implementation of a Real-Time, Interactive, Continuous Speech Recognition System [R] . Dixon, K. R. 1984

机译：实现实时，交互式，连续语音识别系统

Modern standard Arabic speech corpus for implementing and evaluating automatic continuous speech recognition systems

摘要

著录项

相似文献

相关主题

期刊订阅