首页> 外文会议>IASTED International Conference on Signal and Image Processing >COMPARISON OF FIXED AND VARIABLE WEIGHT APPROCHES FOR VISEME CLASSIFICATION
【24h】

COMPARISON OF FIXED AND VARIABLE WEIGHT APPROCHES FOR VISEME CLASSIFICATION

机译:探测性分类的固定和可变权重方法的比较

获取原文

摘要

Several researchers have demonstrated that a visual speech reading system is beneficial complement to an audio speech recognition system by using of visual speech cues of the speakers face in noisy environment. However, robust and accurate visual feature extraction and classification are difficult object recognition and classification problems, due to high variation in pose, lighting and dynamic nature of the visemes. In this paper, a novel variable weights approach for classifying visemes is presented and compared with fixed weights based classification approach. Firstly, an approach using fixed significance factors (weights) for various components of visemes including mouth gestures is employed for visemes classification. The approach assumes that all visual features have same significance factor for every phoneme. The second approach is based on the hypothesis that the significance of a visual feature is variable for different phonemes. The efficiency of the variable weights approach is evaluated by comparing its results with fixed weights algorithm findings. The recognition results indicate that the variable weight approach has better performance than the fixed weight approach. The results presented demonstrate a highly accurate viseme classification approach with an average alphabet detection rate of about 36.9%. Furthermore, on average around 53% of alphabets were accurately detected using the viseme classifier described in this study.
机译:几位研究人员已经证明,通过在嘈杂环境中使用扬声器面部的视觉语音线索,视觉语音阅读系统对音频语音识别系统是有益的补充。然而,由于姿势,照明和动态性质的高度变化,稳健和准确的视觉特征提取和分类是困难的对象识别和分类问题。本文介绍了一种新的可变权重方法,并与固定权重的分类方法进行了比较。首先,采用包括嘴姿势的各种组件的固定意义因子(重量)的方法用于探测器分类。该方法假设所有视觉特征都具有每个音素具有相同的重要因素。第二种方法基于视觉特征的重要性对于不同音素的可变的假设。通过将其结果与固定权重算法结果进行比较来评估可变权重方法的效率。识别结果表明可变权重方法具有比固定权重方法更好的性能。提出的结果证明了高度准确的探测性分类方法,平均字母检测率约为36.9%。此外,平均使用本研究中描述的视野分类器精确地检测到约53%的字母表。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号