首页> 外文学位 >Integrating gesture recognition and speech recognition in a touch-less human computer interaction system.
【24h】

Integrating gesture recognition and speech recognition in a touch-less human computer interaction system.

机译:在非接触式人机交互系统中集成手势识别和语音识别。

获取原文
获取原文并翻译 | 示例

摘要

We envision a command and control scenario where the speaker makes hand gestures while referring to objects on a computer display (monitor or projection screen). The objective is automated recognition of the gestures to manipulate the virtual objects on the display. Our approach advances the state of the art in human computer interaction technology in the following unique ways: (i) the user is expected to be at a distance from the display thus the sensing is "touchless" and is entirely based upon one or more camera unobtrusively placed in the environment, (ii) the gestures considered are predominantly two-handed, which are natural and intuitive as if speaking with the screen serving as a prop, and (iii) coherent multimodal integration of speech and gestures.;We have trained HMMs and HCRF models using features such as PCA and Optical Flow. A pose estimation algorithm has been designed to identify the object of interest. It involves locating the hand using skin region modeling for each user in real time. The speech and gesture recognition modules provide independent outputs, which are integrated to execute the user commands. Experimental results on video sequences obtained from 11 different users providing five gesture classes are discussed.
机译:我们设想一种命令和控制方案,在该方案中,说话者在指代计算机显示屏(监视器或投影屏)上的对象时做出手势。目的是自动识别手势以操纵显示器上的虚拟对象。我们的方法通过以下独特方式推动了人类计算机交互技术的发展:(i)预期用户与显示器相距一定距离,因此感应是“非接触式”的,并且完全基于一个或多个摄像头;(ii)所考虑的手势主要是双手操作,自然而直观,就像用屏幕作为道具说话一样;(iii)语音和手势的连贯多模式集成。 HMM和HCRF使用PCA和Optical Flow等功能进行建模。姿势估计算法已被设计为识别感兴趣的对象。它涉及使用每个用户的皮肤区域建模实时定位手。语音和手势识别模块提供独立的输出,这些输出经过集成以执行用户命令。讨论了从11个不同的用户获得的提供5种手势类别的视频序列的实验结果。

著录项

  • 作者

    Purkayastha, Bhaskar.;

  • 作者单位

    State University of New York at Buffalo.;

  • 授予单位 State University of New York at Buffalo.;
  • 学科 Computer Science.
  • 学位 M.S.
  • 年度 2009
  • 页码 73 p.
  • 总页数 73
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

  • 入库时间 2022-08-17 11:38:03

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号