首页> 外文期刊>Engineering Applications of Artificial Intelligence >American Sign Language alphabet recognition using Convolutional Neural Networks with multiview augmentation and inference fusion
【24h】

American Sign Language alphabet recognition using Convolutional Neural Networks with multiview augmentation and inference fusion

机译:使用卷积神经网络结合多视图增强和推理融合的美国手语字母识别

获取原文
获取原文并翻译 | 示例
       

摘要

American Sign Language (ASL) alphabet recognition by computer vision is a challenging task due to the complexity in ASL signs, high interclass similarities, large intraclass variations, and constant occlusions. This paper describes a method for ASL alphabet recognition using Convolutional Neural Networks (CNN) with multiview augmentation and inference fusion, from depth images captured by Microsoft Kinect. Our approach augments the original data by generating more perspective views, which makes the training more effective and reduces the potential overfitting. During the inference step, our approach comprehends information from multiple views for the final prediction to address the confusing cases caused by orientational variations and partial occlusions. On two public benchmark datasets, our method outperforms the state-of-the-arts.
机译:由于ASL标志的复杂性,类别间的高度相似性,类别间的较大差异以及持续的遮挡,因此通过计算机视觉识别美国手语(ASL)字母是一项具有挑战性的任务。本文介绍了一种基于卷积神经网络(CNN)的ASL字母识别方法,该方法具有从Microsoft Kinect捕获的深度图像中进行多视图增强和推理融合的功能。我们的方法通过生成更多的透视图来增强原始数据,从而使训练更有效并减少潜在的过度拟合。在推理步骤中,我们的方法从多个视图中获取信息以进行最终预测,以解决由方向变化和部分遮挡引起的混乱情况。在两个公共基准数据集上,我们的方法优于最新技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号