首页> 外文会议>International Moratuwa Engineering Research Conference >Sinhala Speech Recognition for Interactive Voice Response Systems Accessed Through Mobile Phones
【24h】

Sinhala Speech Recognition for Interactive Voice Response Systems Accessed Through Mobile Phones

机译:SINHALA通过移动电话访问的交互式语音响应系统的语音识别

获取原文
获取外文期刊封面目录资料

摘要

This paper presents the development of a Sinhala Speech Recognition System to be deployed in an Interactive Voice Response (IVR) system of a telecommunication service provider. The main objectives are to recognize Sinhala digits and names of Sinhala songs to be set up as ringback tones. Sinhala being a phonetic language, its features are studied to develop a list of 47 phonemes. A continuous speech recognition system is developed based on Hidden Markov Model (HMM). The acoustic model is trained using the voice through mobile phone. The outcome is a speaker independent speech recognition system which is capable of recognizing 10 digits and 50 Sinhala songs. A word error rate (WER) of 11.2% using a speech corpus of 0.862 hours and a sentence error rate (SER) of 5.7% using a speech corpus of 1.388 hours are achieved for digits and songs respectively.
机译:本文介绍了在电信服务提供商的交互式语音响应(IVR)系统中部署的Sinhala语音识别系统的开发。主要目标是识别僧伽罗的数字和僧伽伽罗歌曲的名称被设置为回铃音。 Sinhala是一种语音语言,其特征是研究了47个音素的列表。基于隐马尔可夫模型(HMM)开发了连续语音识别系统。声学模型通过移动电话使用声音培训。结果是一种扬声器独立的语音识别系统,能够识别10位数和50首唱歌歌曲。使用0.862小时的语音语料库的词汇率(WER)为11.2 %,使用1.388小时的语音语料库,分别为数字和歌曲的句子错误率为5.7 %。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号