首页> 外文期刊>Expert systems with applications >Hand gesture recognition via enhanced densely connected convolutional neural network
【24h】

Hand gesture recognition via enhanced densely connected convolutional neural network

机译:手势识别通过增强型密集连接的卷积神经网络

获取原文
获取原文并翻译 | 示例

摘要

Hand gesture recognition (HGR) serves as a fundamental way of communication and interaction for human being. While HGR can be applied in human computer interaction (HCI) to facilitate user interaction, it can also be utilized for bridging the language barrier. For instance, HGR can be utilized to recognize sign language, which is a visual language represented by hand gestures and used by the deaf and mute all over the world as a primary way of communication. Hand-crafted approach for vision-based HGR typically involves multiple stages of specialized processing, such as hand-crafted feature extraction methods, which are usually designed to deal with particular challenges specifically. Hence, the effectiveness of the system and its ability to deal with varied challenges across multiple datasets are heavily reliant on the methods being utilized. In contrast, deep learning approach such as convolutional neural network (CNN), adapts to varied challenges via supervised learning. However, attaining satisfactory generalization on unseen data is not only dependent on the architecture of the CNN, but also dependent on the quantity and variety of the training data. Therefore, a customized network architecture dubbed as enhanced densely connected convolutional neural network (EDenseNet) is proposed for vision-based hand gesture recognition. The modified transition layer in EDenseNet further strengthens feature propagation, by utilizing bottleneck layer to propagate the features being reused to all the feature maps in a bottleneck manner, and the following Conv layer smooths out the unwanted features. Differences between EDenseNet and DenseNet are discerned, and its performance gains are scrutinized in the ablation study. Furthermore, numerous data augmentation techniques are utilized to attenuate the effect of data scarcity, by increasing the quantity of training data, and enriching its variety to further improve generalization. Experiments have been carried out on multiple datasets, namely one NUS hand gesture dataset and two American Sign Language (ASL) datasets. The proposed EDenseNet obtains 98.50% average accuracy without augmented data, and 99.64% average accuracy with augmented data, outperforming other deep learning driven instances in both settings, with and without augmented data.
机译:手势识别(HGR)用作人类的沟通和互动的基本途径。虽然HGR可以应用于人机交互(HCI),以便于用户交互,但也可以用于弥合语言屏障。例如,HGR可以用来识别标志语言,这是一种由手势代表的视觉语言,并被聋人用来作为一个主要的通信方式。用于视觉的HGR的手工制作方法通常涉及多个专业加工阶段,例如手工制作的特征提取方法,通常旨在具体地处理特定的挑战。因此,系统的有效性及其在多个数据集中处理多种挑战的能力严重依赖于所使用的方法。相比之下,卷积神经网络(CNN)等深入学习方法,通过监督学习适应各种挑战。然而,在看不见数据上获得令人满意的概括不仅取决于CNN的架构,而且取决于培训数据的数量和各种。因此,提出了一种作为增强型密集连接的卷积神经网络(Edensenet)的定制网络架构,用于基于视觉的手势识别。通过利用瓶颈层将特征传播以瓶颈方式传播到所有特征映射的特征来传播特征传播,进一步增强特征传播,并且以下CONC层平滑了不需要的特征。探测Edensenet和DenSenet之间的差异,并且在消融研究中审查其性能增益。此外,利用许多数据增强技术来通过增加训练数据的数量来衰减数据稀缺性的效果,并富集其变化以进一步提高泛化。实验已经在多个数据集中执行,即一个Nus手势数据集和两种美国手语(ASL)数据集。建议的Edensenet在没有增强数据的情况下获得98.50%的平均精度,增强数据的平均精度为99.64%,始终表现出两种设置中的其他深度学习驱动的实例,有和没有增强数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号