首页> 外文会议>International Conference on speech and computer >Speech Recognition Challenges in the Car Navigation Industry
【24h】

Speech Recognition Challenges in the Car Navigation Industry

机译:汽车导航行业中的语音识别挑战

获取原文

摘要

Until a few decades ago, machines talking and understanding human speech were only the subject of science fiction. Nowadays, Text to Speech (TTS) and Automatic Speech Recognition (ASR) became reality, but they are still being considered to be fancy. Automotive infotainment is a selling point for car manufacturers, it is a symbol of being hi-tech, and car commercials often feature the display of the head unit for a few seconds. As avoiding Driver Distraction has grown a major design aspect, ASR is becoming trendy and almost compulsory. But let us see how far we have gotten. In the first part, this talk will summarize the most popular Speech features in today's car navigation systems, and will look into the underlying technology, solutions and limitations widely applied in the industry. We will mention typical context designs, dialogue systems and address search, and we will show how the common technology leads to typical HMI solutions. We will point out the possibilities and limitations of on-board and server-based recognition, and consider why we need to resort to exclusively offline solutions for a while in this industry. At this point we will have an overview of the ingredients, so the talk will focus on problematic and sub-optimal ASR features requested by automotive manufacturers, explaining why they negatively affect recognition accuracy. A workaround often leads to troublesome and seemingly unnecessary questions for the user, so it is not easy to compromise. In the last part, we will examine a certain address search scenario which is trivial for users, and is feasible with a server-based ASR, however being an open question when done offline.
机译:直到几十年前,说话和理解人类语音的机器只是科幻小说的主题。如今,文字转语音(TTS)和自动语音识别(ASR)成为现实,但它们仍被认为是奇特的。汽车信息娱乐系统是汽车制造商的卖点,它是高科技的象征,汽车广告通常会在主机头上显示几秒钟。由于避免驾驶员分心已成为设计的主要方面,因此ASR变得越来越流行,并且几乎是强制性的。但是,让我们看看我们已经走了多远。在第一部分中,本演讲将总结当今汽车导航系统中最流行的语音功能,并将探讨在行业中广泛应用的基础技术,解决方案和局限性。我们将提到典型的上下文设计,对话系统和地址搜索,并且将展示通用技术如何导致典型的HMI解决方案。我们将指出基于板载和基于服务器的识别的可能性和局限性,并考虑为什么在这个行业中我们为什么需要一段时间使用专有的脱机解决方案。在这一点上,我们将对成分进行概述,因此演讲将集中于汽车制造商要求的有问题的和次优的ASR功能,并解释为什么它们会对识别精度产生负面影响。解决方法通常会给用户带来麻烦和看似不必要的问题,因此不容易妥协。在最后一部分中,我们将研究一种对于用户而言微不足道的地址搜索方案,并且该方案对于基于服务器的ASR是可行的,但是在脱机时是一个悬而未决的问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号