Multimodal interactive spaces: MagicTV and magicMAP

机译：多模式互动空间：MagicTV和magicMAP

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Through the growing popularity of voice-enabled search, multimodal applications are finally starting to get into the hands of consumers. However, these applications are principally for mobile platforms and generally involve highly-moded interaction where the user has to click or hold a button in order to speak. Significant technical challenges remain in bringing multimodal interaction to other environments such as smart living rooms and classrooms, where users speech and gesture is directed toward large displays or interactive kiosks and the microphone and other sensors are ‘always on’. In this demonstration, we present a framework combining low cost hardware and open source software that lowers the barrier of entry for exploration of multimodal interaction in smart environments. Specifically, we will demonstrate the combination of infrared tracking, face detection, and open microphone speech recognition for media search (magicTV) and map navigation (magicMap).

机译：通过启用语音的搜索的日益普及，多模式应用程序终于开始进入消费者的手中。但是，这些应用程序主要用于移动平台，通常涉及高度交互的交互，其中用户必须单击或按住按钮才能讲话。在将多模式交互引入其他环境（例如智能客厅和教室）时，仍然存在重大技术挑战，在这种环境中，用户的语音和手势指向大型显示器或交互式信息亭，并且麦克风和其他传感器始终处于“打开”状态。在本演示中，我们提出了一个结合了低成本硬件和开源软件的框架，该框架降低了在智能环境中探索多模式交互的入门门槛。具体而言，我们将演示红外跟踪，面部检测和开放式麦克风语音识别的组合，以进行媒体搜索（magicTV）和地图导航（magicMap）。

著录项

来源
《2010 IEEE Spoken Language Technology Workshop》|2010年|p.161-162|共2页
会议地点
作者

展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类电声技术和语音信号处理;
关键词
gesture recognition; multimodal integration; open microphone; speech recognition;

机译：手势识别;多模式集成;开放麦克风;语音识别;

相似文献

外文文献
中文文献
专利

1. Interactive Learning in Continuous Multimodal Space: A Bayesian Approach to Action-Based Soft Partitioning and Learning [J] . Firouzi H. Autonomous Mental Development, IEEE Transactions on . 2012,第2期

机译：连续多峰空间中的交互式学习：基于动作的软分区和学习的贝叶斯方法
2. MR Imaging-based Multimodal Autoidentification of Perivascular Spaces (mMAPS): Automated Morphologic Segmentation of Enlarged Perivascular Spaces at Clinical Field Strength [J] . Boespflug Erin L., Schwartz Daniel L., Lahna David, Radiology . 2018,第2期

机译：基于MR成像的脑血管空间的多模式自动识别（MMAPS）：临床强度下大血管空间的自动形态分割
3. Introduction to the Special Issue on Multimodality of Early Sensory Processing: Early Visual Maps Flexibly Encode Multimodal Space [J] . Arrighi Roberto, Binda Paola, Cicchini Guido Marco Multisensory research . 2015,第3a4期

机译：早期感官处理的多模式问题特刊简介：早期的视觉地图灵活地编码多模式空间
4. Multimodal interactive spaces: MagicTV and magicMAP [C] . {missing} IEEE Spoken Language Technology Workshop . 2010

机译：多模式互动空间：Magictv和MagicMap
5. Urban Engawa / Verandah -Fuzzy Spaces In-Between Inside and Outside making Interactive Spaces for Tokyo Urbanites- [D] . Fujii, Machiyo. 2015

机译：Urban Engawa /阳台 - 在内外的内外空间 - 东京城市互动空间 -
6. MR Imaging–based Multimodal Autoidentification of Perivascular Spaces (mMAPS): Automated Morphologic Segmentation of Enlarged Perivascular Spaces at Clinical Field Strength [O] . Erin L. Boespflug, Daniel L. Schwartz, David Lahna, -1

机译：基于MR成像的血管周围空间多模式自动识别（mMAPS）：在临床视野强度下扩大的血管周围空间的形态自动分割
7. MULTIMODAL INTERACTIVE SPACES: MAGICTV AND MAGICMAP [O] . Marcelo Worsley, Michael Johnston 2013

机译：多模态交互空间：MAGICTV和MAGICMAP

Multimodal interactive spaces: MagicTV and magicMAP

摘要

著录项

相似文献

相关主题

期刊订阅