首页> 外文会议>International conference on universal access in human-computer interaction;HCI international 2009;International conference on human-computer interaction;UAHCI 2009 >Partially Observable Markov Decision Process (POMDP) Technologies for Sign Language Based Human-Computer Interaction
【24h】

Partially Observable Markov Decision Process (POMDP) Technologies for Sign Language Based Human-Computer Interaction

机译:基于手语的人机交互的部分可观察马尔可夫决策过程(POMDP)技术

获取原文

摘要

Sign language (SL) recognition modules in human-computer interaction systems need to be both fast and reliable. In cases where multiple sets of features are extracted from the SL data, the recognition system can speed up processing by taking only a subset of extracted features as its input. However, this should not be realised at the expense of a drop in recognition accuracy. By training different recognizers for different subsets of features, we can formulate the problem as the task of planning the sequence of recognizer actions to apply to SL data, while accounting for the trade-off between recognition speed and accuracy. Partially observable Markov decision processes (POMDPs) provide a principled mathematical framework for such planning problems. A POMDP explicitly models the probabilities of observing various outputs from the individual recognizers and thus maintains a probability distribution (or belief) over the set of possible SL input sentences. It then computes a policy that maps every belief to an action. This allows the system to select actions in real-time during online policy execution, adapting its behaviour according to the observations encountered. We illustrate the POMDP approach with a simple sentence recognition problem and show in experiments the advantages of this approach over "fixed action" systems that do not adapt their behaviour in real-time.
机译:人机交互系统中的手语(SL)识别模块需要既快速又可靠。在从SL数据中提取多组特征的情况下,识别系统可以通过仅将提取的特征的子集作为其输入来加快处理速度。但是,这不应以降低识别精度为代价来实现。通过针对不同的特征子集训练不同的识别器,我们可以将问题表达为计划识别器动作序列以应用于SL数据的任务,同时考虑到识别速度和准确性之间的权衡。部分可观察的马尔可夫决策过程(POMDP)为此类规划问题提供了原则上的数学框架。 POMDP显式地建模了观察来自各个识别器的各种输出的概率,从而在可能的SL输入语句的集合上维持了概率分布(或信念)。然后,它计算一个将每个信念映射到一个动作的策略。这使系统可以在联机策略执行期间实时选择操作,并根据遇到的观察来调整其行为。我们用一个简单的句子识别问题说明了POMDP方法,并在实验中显示了该方法相对于不能实时调整其行为的“固定动作”系统的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号