首页> 外文会议>International Conference on Informatics, Health Technology >Towards intelligent arabic text-to-speech application for disabled people
【24h】

Towards intelligent arabic text-to-speech application for disabled people

机译:对残疾人的智能阿拉伯语文本到语音申请

获取原文

摘要

Assistive technology customizes speech technology to offer a new communication channel for disabled people such as blind or having speech difficulties. Converting written text into natural speech has been addressed in the last decades for some languages such as English, hence, used in many applications such as voice answering machines, reading articles and exploring software for blind people. Other languages such as Arabic are still not fully served to have high quality Text-To-Speech applications. This paper describes our effort in developing an intelligent Text-To-Speech mobile application for Arabic. We use a set of statistical language models n-gram for word prediction and auto-completion for easy typing. A large new Arabic corpus for daily communication in different domains is constructed which could be used for other purposes. A serious of normalization processing, including spelling correction, is applied to the corpus to maintain the consistency and unify the occurrence of the same words. We use outsource Sakhr Arabic Text-To-Speeh voices as one of the best speech synthesizer exist for Arabic. To ensure a high usability of the application, we use simple graphical user interface and easy access libraries to favorite phrases with an ability of adding pictures with recorded speech. Our experiments shows that word prediction using global and local corpus decries 50% of keystroke of typing desired sentences with a high prediction of 84% of bigram model.
机译:辅助技术定制语音技术,为残疾人提供新的通信渠道,如盲目或具有言论困难。在过去几十年中,在诸如英语之类的某些语言中,在许多应用程序中使用的诸如语音应答机,阅读文章和盲人软件的许多语言,已经解决了自然演讲。其他语言如阿拉伯语仍然没有完全服务于具有高质量的文本到语音应用程序。本文介绍了我们在为阿拉伯语开发智能文本到语音移动应用程序方面的努力。我们使用一组统计语言模型n-gram用于Word预测和自动完成,便于打字。构建了不同域中日常通信的大型新阿拉伯语语料库,可用于其他目的。严重的归一化处理,包括拼写校正,应用于语料库,以维持一致性并统一相同词语的发生。我们将外包Sakhr Arabic To-Speeh声音作为阿拉伯语存在的最佳语音合成器之一。为确保应用程序的高可用性,我们使用简单的图形用户界面和轻松访问库,以便在具有录制语音中添加图片的能力。我们的实验表明,使用全局和本地语料库的单词预测缩小了50%的打字所需句子的击键,其高预测为84%的Bigram模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号