首页> 外文期刊>International journal of speech technology >Development and analysis of Punjabi ASR system for mobile phones under different acoustic models
【24h】

Development and analysis of Punjabi ASR system for mobile phones under different acoustic models

机译:不同声学模型下手机旁遮普ASR系统的开发与分析

获取原文
获取原文并翻译 | 示例
           

摘要

Speech technology is widely gaining importance in our daily life. Speech based mobile phone applications are becoming popular in masses due to their usability and ease of access. Speech technology is helping people, with disabilities like blindness and physical abnormalities, to access and control mobile phone applications through voice, without using keypad or touchpad. Punjabi is one of the widely spoken language in various parts of the world. In this paper, an automatic speech recognition (ASR) system for mobile phone applications in Punjabi has been proposed and implemented for four different acoustic models- context independent, context dependent untied, context dependent tied, and context dependent deleted interpolation models. The proposed ASR is evaluated at 4, 16, 32 and 64 GMMs for performance analysis in terms of parameters like accuracy, word error rate and storage space required. It is observed that context dependent untied models outperform others by having better accuracy and lower word error rate, while context independent models require less storage space than others. The choice of fruitful acoustic model depends upon the available storage space as well as desired recognition accuracy. Mobile phones having limited resources may use context independent models, while context dependent untied models can be used to develop ASR system for high end mobile phones.
机译:语音技术在我们的日常生活中日益重要。基于语音的移动电话应用由于其可用性和易用性而在大众中变得越来越流行。语音技术正在帮助盲人和身体异常等残障人士通过语音访问和控制手机应用程序,而无需使用键盘或触摸板。旁遮普语是世界各地广泛使用的语言之一。在本文中,针对四种不同的声学模型提出了一种针对旁遮普手机应用程序的自动语音识别(ASR)系统,该系统用于上下文无关,上下文无关解绑,上下文相关并列和上下文相关删除内插模型。拟议的ASR在4、16、32和64 GMM处进行评估,以根据准确度,字错误率和所需存储空间等参数进行性能分析。可以看出,上下文相关的非绑定模型具有更高的准确性和更低的字错误率,而其性能优于其他模型,而上下文无关的模型比其他模型需要更少的存储空间。富有成效的声学模型的选择取决于可用的存储空间以及所需的识别精度。资源有限的移动电话可以使用上下文无关的模型,而上下文相关的非捆绑模型可以用于开发高端移动电话的ASR系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号