【24h】

Information Enquiry Kiosk with Multimodal User Interface

机译:具有多模式用户界面的信息查询亭

获取原文
获取原文并翻译 | 示例
           

摘要

A multimodal interactive dialogue automaton (kiosk) for self-service is presented in the paper.Multimodal user interface allow people to interact with the kiosk by natural speech, gestures additionally tothe standard input and output devices. Architecture of the kiosk contains key modules of speech processingand computer vision. An array of four microphones is applied for far-field capturing and recording of user'sspeech commands, it allows the kiosk to detect voice activity, to localize sources of desired speech signals, andto eliminate environmental acoustical noises. A noise robust speaker-independent recognition system isapplied to automatic interpretation and understanding of continuous Russian speech. The distant speech rec-ognizer uses grammar of voice queries as well as garbage and silence models to improve recognition accuracy.Pair of portable video-cameras are applied for vision-based detection and tracking of user's head and bodyposition inside of the working area. Russian-speaking talking head serves both for bimodal audio-visualspeech synthesis and for improvement of communication intelligibility by turning the head to an approachingclient. Dialogue manager controls the flow of dialogue and synchronizes sub-modules for input modalitiesfusion and output modalities fission. The experiments made with the multimodal kiosk were directed to cog-nitive and usability studies of human-computer interaction by different communication means.
机译:本文介绍了一种用于自助服务的多模式交互式对话自动机(kiosk)。多模式用户界面允许人们通过自然语音,手势以及标准输入和输出设备与信息亭进行交互。信息亭的体系结构包含语音处理和计算机视觉的关键模块。四个麦克风阵列用于远距离捕获和记录用户的语音命令,它使信息亭可以检测语音活动,定位所需语音信号的源并消除环境声学噪声。噪声鲁棒的独立于说话人的识别系统被应用于自动解释和理解连续的俄罗斯语音。遥距语音识别器使用语音查询的语法以及垃圾和静音模型来提高识别精度。一对便携式摄像机被用于基于视觉的检测和跟踪用户在工作区域内的头部和身体位置。讲俄语的通话头既可用于双模式视听语音合成,又可通过将其转向接近的客户来改善通信清晰度。对话管理器控制对话的流程并同步子模块,以实现输入形态融合和输出形态裂变。使用多模式信息亭进行的实验旨在通过不同的通信方式进行人机交互的认知和可用性研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号