首页> 外文会议>2010 IEEE Spoken Language Technology Workshop >Multimodal interactive spaces: MagicTV and magicMAP
【24h】

Multimodal interactive spaces: MagicTV and magicMAP

机译:多模式互动空间:MagicTV和magicMAP

获取原文

摘要

Through the growing popularity of voice-enabled search, multimodal applications are finally starting to get into the hands of consumers. However, these applications are principally for mobile platforms and generally involve highly-moded interaction where the user has to click or hold a button in order to speak. Significant technical challenges remain in bringing multimodal interaction to other environments such as smart living rooms and classrooms, where users speech and gesture is directed toward large displays or interactive kiosks and the microphone and other sensors are ‘always on’. In this demonstration, we present a framework combining low cost hardware and open source software that lowers the barrier of entry for exploration of multimodal interaction in smart environments. Specifically, we will demonstrate the combination of infrared tracking, face detection, and open microphone speech recognition for media search (magicTV) and map navigation (magicMap).
机译:通过启用语音的搜索的日益普及,多模式应用程序终于开始进入消费者的手中。但是,这些应用程序主要用于移动平台,通常涉及高度交互的交互,其中用户必须单击或按住按钮才能讲话。在将多模式交互引入其他环境(例如智能客厅和教室)时,仍然存在重大技术挑战,在这种环境中,用户的语音和手势指向大型显示器或交互式信息亭,并且麦克风和其他传感器始终处于“打开”状态。在本演示中,我们提出了一个结合了低成本硬件和开源软件的框架,该框架降低了在智能环境中探索多模式交互的入门门槛。具体而言,我们将演示红外跟踪,面部检测和开放式麦克风语音识别的组合,以进行媒体搜索(magicTV)和地图导航(magicMap)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号