首页> 外文会议>Gesture-Based Communication in Human-Computer Interaction >Toward Scalability in ASL Recognition: Breaking Down Signs into Phonemes
【24h】

Toward Scalability in ASL Recognition: Breaking Down Signs into Phonemes

机译:迈向ASL识别的可扩展性:将符号分解为音素

获取原文

摘要

In this paper we present a novel approach to continuous, whole-sentence ASL recognition that uses phonemes instead of whole signs as the basic units. Our approach is based on a sequential phonological model of ASL. According to this model the ASL signs can be broken into movements and holds, which are both considered phonemes. This model does away with the distinction between whole signs and epenthesis movements that we made in previous work. Instead, epenthesis movements are just like the other movements that constitute the signs. We subsequently train Hidden Markov Models (HMMs) to recognize the phonemes, instead of whole signs and epenthesis movements that we recognized previously. Because the number of phonemes is limited, HMM-based training and recognition of the ASL signal becomes computationally more tractable and has the potential to lead to the recognition of large-scale vocabularies. We experimented with a 22 word vocabulary, and we achieved similar recognition rates with phoneme-and word-based approaches. This result is very promising for scaling the task in the future.
机译:在本文中,我们提出了一种新颖的连续,整句ASL识别方法,该方法使用音素而不是整个符号作为基本单位。我们的方法基于ASL的顺序语音模型。根据此模型,ASL标志可以分解为动作和保持,它们都被视为音素。该模型消除了我们在以前的工作中所做的整个体征和上肢运动之间的区别。取而代之的是,上肢运动就像其他构成标志的运动一样。随后,我们训练了隐马尔可夫模型(HMM)来识别音素,而不是我们之前识别的整个符号和上肢运动。由于音素的数量有限,因此基于HMM的ASL信号训练和识别在计算上变得更容易处理,并且有可能导致识别大量词汇。我们使用22个单词的词汇进行了实验,并通过基于音素和单词的方法获得了相似的识别率。这个结果对于将来扩展任务很有希望。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号