首页> 外文会议>International conference on advances in computing, communications and informatics >Sign language recognition through kinect based depth images and neural network
【24h】

Sign language recognition through kinect based depth images and neural network

机译:通过基于kinect的深度图像和神经网络进行手语识别

获取原文

摘要

Sign language is the language of the people with hearing and speaking disabilities. In it mostly hands are moved in a particular way which along with some facial expression produces a meaningful thought which the speaker would like to convey to others. Using the sign language people with speaking and hearing disabilities can communicate with others who know the language very easily but it becomes difficult when it comes to interacting with a normal person. As a result there is a requirement of an intermediate system which will help in improving the interaction between people with the hearing disabilities as well as with the normal people. In this paper we present a sign language recognition technique using kinect depth camera and neural network. Using the kinect camera we obtain the image of the person standing in front of the camera and then we crop the hand region from the depth image and pre-process that image using the morphological operations to remove unwanted region from the hand image and find the contour of the hand sign and from the particular contour position of the hand we generate a signal on which Discrete Cosine Transform (DCT) is applied and first 200 DCT coefficient of the signal are feed to the neural network for training and classification and finally the network classify and recognize the sign. A data set of sign from 0 to 9 are formed using kinect camera and we tested on 1236 images in the database on which training is applied and we achieved 98% training and an average accuracy for all the sign recognition as 83.5%.
机译:手语是听力和语言障碍人士的语言。在这种情况下,大多数手都以特定的方式移动,这与某些面部表情一起产生了有意义的思想,演讲者想传达给其他人。使用手语的残障人士可以很容易地与其他会语言的人进行交流,但是与正常人互动时会变得很困难。因此,需要一种中间系统,该系统将有助于改善听力障碍人士与普通人之间的互动。在本文中,我们提出了一种使用kinect深度摄像头和神经网络的手语识别技术。使用kinect相机,我们获得了站在相机前的人的图像,然后从深度图像中裁剪出手部区域,并使用形态学运算对该图像进行预处理,以从手部图像中删除不需要的区域并找到轮廓根据手的特定轮廓位置生成一个信号,在信号上应用离散余弦变换(DCT),并将该信号的前200个DCT系数馈入神经网络以进行训练和分类,最后对网络进行分类并识别标志。使用kinect相机形成了从0到9的符号数据集,我们在数据库中对1236张图像进行了测试,并对其进行了训练,我们获得了98%的训练,所有符号识别的平均准确度为83.5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号