首页> 外文会议>International Conference on Pattern Recognition >Leveraging intra-class variations to improve large vocabulary gesture recognition
【24h】

Leveraging intra-class variations to improve large vocabulary gesture recognition

机译:利用班级内部变体来改善大词汇量手势识别

获取原文

摘要

Large vocabulary gesture recognition using a training set of limited size is a challenging problem in computer vision. With few examples per gesture class, researchers often employ exemplar-based methods such as Dynamic Time Warping (DTW). This paper makes two contributions in the area of exemplar-based gesture recognition: 1) it introduces Multiple-Pass DTW (MP-DTW), a method in which scores from multiple DTW passes focusing on different gesture properties are combined, and 2) it introduces a new set of features modeling intra-class variation of several gesture properties that can be used in conjunction with MP-DTW or DTW. We demonstrate that these techniques provide substantial improvement over DTW in both user-dependent and user-independent experiments on American Sign Language (ASL) datasets, even when using noisy data generated by RGB-D skeleton detectors. We further show that using these techniques in a large vocabulary system with a limited training set provides significantly better results compared to Long Short-Term Memory (LSTM) network and Hidden Markov Model (HMM) approaches.
机译:使用有限大小的训练集进行大词汇量手势识别是计算机视觉中的一个难题。对于每个手势类,只有很少的示例,研究人员经常采用基于示例的方法,例如动态时间规整(DTW)。本文在基于示例的手势识别领域做出了两点贡献:1)引入了多次通过DTW(MP-DTW),一种将来自多个DTW通过的,专注于不同手势属性的得分进行组合的方法,以及2)引入了一组新功能,这些功能可以对可与MP-DTW或DTW结合使用的几种手势属性的类内变化进行建模。我们证明,即使在使用RGB-D骨架检测器生成的嘈杂数据时,在美国手语(ASL)数据集上的用户依赖和用户独立实验中,这些技术也大大优于DTW。我们进一步表明,与长短期记忆(LSTM)网络和隐马尔可夫模型(HMM)方法相比,在训练集有限的大型词汇系统中使用这些技术可提供明显更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号