首页> 外文期刊>The imaging science journal >A novel key-frame selection-based sign language recognition framework for the video data
【24h】

A novel key-frame selection-based sign language recognition framework for the video data

机译:一种用于视频数据的新型键帧选择的标志语言识别框架

获取原文
获取原文并翻译 | 示例
       

摘要

Sign language is a medium of communication for people with hearing disabilities. Static and dynamic gestures are identified in a video-based sign language recognition and translated them into humanly understandable phrases to achieve the communication objective. However, videos contain redundant Key-frames which require additional processing. Number of such Key-frames can be reduced. The selection of particular Key-frames without losing the required information is a challenging task. The Key-frame extraction algorithm is used which helps to speed-up the sign language recognition process by extracting essential key-frames. The proposed framework eliminates the computation overhead by picking up the distinct Key-frames for the recognition process. Discrete Wavelet Transform (DWT), Discrete Cosine Transform (DCT), and Histograms of Oriented Gradient (HOG) are used for unique features extraction. We used the bagged tree, boosted tree ensemble method, Fine KNN, and SVM for classification. We tested methodology on video-based datasets of Pakistani Sign Language. It achieved an overall 97.5% accuracy on 37 Urdu alphabets and 95.6% accuracy on 100 common words.
机译:手语是听证残疾人的沟通媒介。静态和动态手势在基于视频的行语识别中识别并将其转换为人类可理解的短语以实现通信目标。但是,视频包含需要额外处理的冗余键帧。可以减少此类键帧的数量。在不丢失所需信息的情况下选择特定键帧是一个具有挑战性的任务。使用键帧提取算法,其有助于通过提取基本密钥帧来加速行程识别过程。所提出的框架通过拾取识别过程的独特键帧来消除计算开销。离散小波变换(DWT),离散余弦变换(DCT)和定向梯度(HOG)的直方图用于独特的特征提取。我们使用袋装树,提升树集合方法,细kNN和SVM进行分类。我们在巴基斯坦手语的基于视频数据集上测试了方法。它在37个Urdu字母表上实现了97.5%的准确性和100个常用词的95.6%。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号