首页> 外文学位 >Motion-capture-based hand gesture recognition for computing and control
【24h】

Motion-capture-based hand gesture recognition for computing and control

机译:基于运动捕捉的手势识别,用于计算和控制

获取原文
获取原文并翻译 | 示例

摘要

This dissertation focuses on the study and development of algorithms that enable the analysis and recognition of hand gestures in a motion capture environment. Central to this work is the study of unlabeled point sets in a more abstract sense. Evaluations of proposed methods focus on examining their generalization to users not encountered during system training.;In an initial exploratory study, we compare various classification algorithms based upon multiple interpretations and feature transformations of point sets, including those based upon aggregate features (e.g. mean) and a pseudo-rasterization of the capture space. We find aggregate feature classifiers to be balanced across multiple users but relatively limited in maximum achievable accuracy. Certain classifiers based upon the pseudo-rasterization performed best among tested classification algorithms. We follow this study with targeted examinations of certain subproblems.;For the first subproblem, we introduce the a fortiori expectation-maximization (AFEM) algorithm for computing the parameters of a distribution from which unlabeled, correlated point sets are presumed to be generated. Each unlabeled point is assumed to correspond to a target with independent probability of appearance but correlated positions. We propose replacing the expectation phase of the algorithm with a Kalman filter modified within a Bayesian framework to account for the unknown point labels which manifest as uncertain measurement matrices. We also propose a mechanism to reorder the measurements in order to improve parameter estimates. In addition, we use a state-of-the-art Markov chain Monte Carlo sampler to efficiently sample measurement matrices. In the process, we indirectly propose a constrained k-means clustering algorithm. Simulations verify the utility of AFEM against a traditional expectation-maximization algorithm in a variety of scenarios.;In the second subproblem, we consider the application of positive definite kernels and the earth mover's distance (END) to our work. Positive definite kernels are an important tool in machine learning that enable efficient solutions to otherwise difficult or intractable problems by implicitly linearizing the problem geometry. We develop a set-theoretic interpretation of ENID and propose earth mover's intersection (EMI). a positive definite analog to ENID. We offer proof of EMD's negative definiteness and provide necessary and sufficient conditions for ENID to be conditionally negative definite, including approximations that guarantee negative definiteness. In particular, we show that ENID is related to various min-like kernels. We also present a positive definite preserving transformation that can be applied to any kernel and can be used to derive positive definite EMD-based kernels, and we show that the Jaccard index is simply the result of this transformation applied to set intersection. Finally, we evaluate kernels based on EMI and the proposed transformation versus ENID in various computer vision tasks and show that END is generally inferior even with indefinite kernel techniques.;Finally, we apply deep learning to our problem. We propose neural network architectures for hand posture and gesture recognition from unlabeled marker sets in a coordinate system local to the hand. As a means of ensuring data integrity, we also propose an extended Kalman filter for tracking the rigid pattern of markers on which the local coordinate system is based. We consider fixed- and variable-size architectures including convolutional and recurrent neural networks that accept unlabeled marker input. We also consider a data-driven approach to labeling markers with a neural network and a collection of Kalman filters. Experimental evaluations with posture and gesture datasets show promising results for the proposed architectures with unlabeled markers, which outperform the alternative data-driven labeling method.
机译:本文致力于算法的研究和开发,该算法使得能够在运动捕捉环境中分析和识别手势。这项工作的核心是从更抽象的角度研究未标记的点集。对提出的方法的评估着重于检查其对系统培训期间未遇到的用户的概括。;在最初的探索性研究中,我们比较了基于点集的多种解释和特征转换的各种分类算法,包括基于集合特征(例如均值)的分类算法以及捕获空间的伪栅格化。我们发现聚合特征分类器可以在多个用户之间保持平衡,但是在最大可实现精度方面相对有限。在测试的分类算法中,基于伪栅格化的某些分类器表现最佳。我们在研究后对某些子问题进行了有针对性的检查。对于第一个子问题,我们引入了一个fortiori期望最大化(AFEM)算法,该算法用于计算分布的参数,并据此推算出未标记的相关点集。假定每个未标记的点对应于具有独立出现概率但位置相关的目标。我们建议用在贝叶斯框架内修改的卡尔曼滤波器代替算法的期望阶段,以解决未知点标签,这些标签表现为不确定的测量矩阵。我们还提出了一种对测量进行重新排序以改善参数估计的机制。此外,我们使用最先进的马尔可夫链蒙特卡洛采样器来高效地对测量矩阵进行采样。在此过程中,我们间接提出了一种约束k均值聚类算法。通过仿真验证了AFEM在各种情况下与传统的期望最大化算法的对比。在第二个子问题中,我们考虑将正定核和推土铲的距离(END)应用于我们的工作。正定核是机器学习中的重要工具,它通过隐式线性化问题的几何形状,可以有效地解决原本困难或棘手的问题。我们开发了ENID的集合论解释,并提出了土方相交点(EMI)。 ENID的正定模拟。我们提供EMD负定性的证明,并为ENID提供条件条件为负定的必要和充分条件,包括保证负定性的近似值。特别是,我们表明ENID与各种min类内核有关。我们还提出了一个正定的保留变换,该变换可以应用于任何核,并且可以用于派生基于正定的基于EMD的核,并且我们证明Jaccard索引只是此变换应用于集合交集的结果。最后,我们在各种计算机视觉任务中基于EMI和拟议的变换与ENID进行了比较,评估了内核,并表明即使使用不确定的内核技术,END通常也较差。;最后,我们将深度学习应用于我们的问题。我们提出了一种神经网络体系结构,用于从手局部坐标系统中的未标记标记集中识别手的姿势和手势。作为确保数据完整性的一种手段,我们还提出了一种扩展的卡尔曼滤波器,用于跟踪局部坐标系所基于的标记的刚性模式。我们考虑固定大小和可变大小的体系结构,包括接受未标记标记输入的卷积和递归神经网络。我们还考虑了一种数据驱动的方法来标记带有神经网络和卡尔曼过滤器的标记。使用姿势和手势数据集进行的实验评估显示,对于带有未标记标记的拟议体系结构,其结果优于可替代的数据驱动标记方法。

著录项

  • 作者

    Gardner, Andrew.;

  • 作者单位

    Louisiana Tech University.;

  • 授予单位 Louisiana Tech University.;
  • 学科 Artificial intelligence.;Computer science.;Statistics.
  • 学位 Ph.D.
  • 年度 2017
  • 页码 211 p.
  • 总页数 211
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号