首页> 外文期刊>The Visual Computer >Audio-visual speech recognition techniques in augmented reality environments
【24h】

Audio-visual speech recognition techniques in augmented reality environments

机译:增强现实环境中的视听语音识别技术

获取原文
获取原文并翻译 | 示例

摘要

Many recent studies show that Augmented Reality (AR) and Automatic Speech Recognition (ASR) technologies can be used to help people with disabilities. Many of these studies have been performed only in their specialized field. Audio-Visual Speech Recognition (AVSR) is one of the advances in ASR technology that combines audio, video, and facial expressions to capture a narrator's voice. In this paper, we combine AR and AVSR technologies to make a new system to help deaf and hard-of-hearing people. Our proposed system can take a narrator's speech instantly and convert it into a readable text and show the text directly on an AR display. Therefore, in this system, deaf people can read the narrator's speech easily. In addition, people do not need to learn sign-language to communicate with deaf people. The evaluation results show that this system has lower word error rate compared to ASR and VSR in different noisy conditions. Furthermore, the results of using AVSR techniques show that the recognition accuracy of the system has been improved in noisy places. Also, the results of a survey that was conducted with 100 deaf people show that more than 80 % of deaf people are very interested in using our system as an assistant in portable devices to communicate with people.
机译:最近的许多研究表明,增强现实(AR)和自动语音识别(ASR)技术可用于帮助残疾人。其中许多研究仅在其专业领域中进行。视听语音识别(AVSR)是ASR技术的一项进步,该技术结合了音频,视频和面部表情来捕获讲述人的声音。在本文中,我们结合了AR和AVSR技术,创建了一个新的系统来帮助聋哑人和听障人士。我们提出的系统可以立即将讲述者的语音转换为可读的文本,然后直接在AR显示器上显示该文本。因此,在该系统中,聋人可以轻松阅读叙述者的语音。此外,人们无需学习手语即可与聋人交流。评估结果表明,在不同的噪声条件下,该系统的误码率均低于ASR和VSR。此外,使用AVSR技术的结果表明,在嘈杂的地方,系统的识别精度得到了提高。另外,对100位聋人进行的调查结果显示,超过80%的聋人对使用我们的系统作为便携式设备中与人沟通的助手非常感兴趣。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号