首页> 外文期刊>Journal of the Franklin Institute >Modern standard Arabic speech corpus for implementing and evaluating automatic continuous speech recognition systems
【24h】

Modern standard Arabic speech corpus for implementing and evaluating automatic continuous speech recognition systems

机译:用于实现和评估自动连续语音识别系统的现代标准阿拉伯语语音语料库

获取原文
获取原文并翻译 | 示例
       

摘要

This paper presents our work towards developing a new speech corpus for Modern Standard Arabic (MSA), which can be used for implementing and evaluating Arabic speaker-independent, large vocabulary, automatic, and continuous speech recognition systems. The speech corpus was recorded by 40 (20 male and 20 female) Arabic native speakers from 11 countries representing three major regions (Levant, Gulf, and Africa). Three development phases were conducted based on the size of training data, Gaussian mixture distributions, and tied states (senones). Based on our third development phase using 11 hours of training speech data, the acoustic model is composed of 16 Gaussian mixture distributions and the state distributions tied to 300 senones. Using three different data sets, the third development phase obtained 94.32% and 8.10% average word recognition correctness rate and average Word Error Rate (WER), respectively, for same speakers with different sentences (testing sentences). For different speakers with same sentences (training sentences), this work obtained 98.10% and 2.67% average word recognition correctness rate and average WER, respectively, whereas for different speakers with different sentences (testing sentences) this work obtained 93.73% and 8.75% average word recognition correctness rate and average WER, respectively.
机译:本文介绍了我们为现代现代阿拉伯语(MSA)开发新的语料库的工作,该语料库可用于实现和评估与阿拉伯语无关的,大词汇量,自动和连续语音识别系统。来自三个主要地区(黎凡特,海湾和非洲)的11个国家的40位阿拉伯语母语人士(其中20位男性和20位女性)录制​​了语音语料库。根据训练数据的大小,高斯混合分布和束缚态(senones)进行了三个开发阶段。基于我们的第三个开发阶段,使用11个小时的训练语音数据,声学模型由16个高斯混合分布和与300个senone关联的状态分布组成。使用三个不同的数据集,第三发展阶段对于具有不同句子(测试句子)的同一说话者,分别获得94.32%和8.10%的平均单词识别正确率和平均单词错误率(WER)。对于具有相同句子(训练句子)的不同说话者,这项工作分别获得了平均单词识别正确率和平均WER的98.10%和2.67%,而对于具有不同句子(测试句子)的不同说话者,该作品获得了93.73%和8.75%的平均分数。单词识别正确率和平均WER。

著录项

  • 来源
    《Journal of the Franklin Institute》 |2012年第7期|p.2215-2242|共28页
  • 作者单位

    Faculty of Computer Science and Information Technology, University of Malaya, 50603 Kuala Lumpur, Malaysia,King Abdullah II School for Information Technology, University of Jordan, 11942, Amman, Jordan;

    Faculty of Computer Science and Information Technology, University of Malaya, 50603 Kuala Lumpur, Malaysia;

    Faculty of Computer Science and Information Technology, University of Malaya, 50603 Kuala Lumpur, Malaysia;

    Faculty of Computer Science and Information Technology, University of Malaya, 50603 Kuala Lumpur, Malaysia;

    Department of Systems Engineering, King Fahd University of Petroleum and Minerals, KFUPM Box 405, Dhahran 31261, Saudi Arabia;

    Electrical and Computer Engineering Department, Faculty of Engineering, International Islamic University Malaysia, Gombak, 53100 Kuala Lumpur, Malaysia;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-18 02:57:59

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号