首页> 外文会议>IEEE International Conference on Electro Information Technology >Convolutional Neural Network Hand Gesture Recognition for American Sign Language
【24h】

Convolutional Neural Network Hand Gesture Recognition for American Sign Language

机译:卷积神经网络手势识别美国手语

获取原文

摘要

With the advancements in the computer vision technology, learning and using sign languages to communicate with deaf and mute people has become easier. Exciting research is ongoing for providing a global platform for communication in different sign languages. In this paper, we present a Deep Learning based approach to recognize a sign performed in American Sign Language by capturing an image as input. The system can predict the signs of 0 to 9 digits performed by the user. By utilizing image processing to convert RGB data to grayscale images, efficient reduction is achieved in the storage requirements and training time of the Convolutional Neural Network. The objective of the experiment is to find a mix of Image Processing and Deep Learning Architecture with lesser complexity to deploy the system in mobile applications or embedded single board computers. The database is trained from scratch using smaller networks as LeNet-5 and AlexNet as well as deeper network such as Vgg16 and MobileNet v2. The comparison of the recognition accuracies is discussed in the paper. The final selected architecture has only 10 layers including a dropout layer which boosted the training accuracy to 91.37% and testing accuracy to 87.5%.
机译:随着计算机视觉技术的进步,学习和使用与聋人和静音人员沟通的标志语言变得更加容易。令人兴奋的研究正在进行,为不同的标志语言提供全球通信平台。在本文中,我们通过将图像捕获为输入,展示了一种基于深度学习的方法来识别在美国手语中执行的标志。系统可以预测用户执行的0到9位的符号。通过利用图像处理将RGB数据转换为灰度图像,在卷积神经网络的存储需求和训练时间中实现了有效的降低。实验的目的是找到一种图像处理和深度学习架构的混合,复杂于在移动应用程序或嵌入式单板计算机中部署系统。使用较小的网络作为LENET-5和AlexNet以及更深的网络,以及vgg16和MobileNet v2,数据库从头开始培训。本文讨论了识别精度的比较。最终选定的架构只有10层,包括辍学层,它提升到91.37%的训练准确度,并测试精度为87.5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号