Score-Level Multi Cue Fusion for Sign Language Recognition

机译：分数级多提示融合，用于行语识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sign Languages are expressed through hand and upper body gestures as well as facial expressions. Therefore, Sign Language Recognition (SLR) needs to focus on all such cues. Previous work uses hand-crafted mechanisms or network aggregation to extract the different cue features, to increase SLR performance. This is slow and involves complicated architectures. We propose a more straightforward approach that focuses on training separate cue models specializing on the dominant hand, hands, face, and upper body regions. We compare the performance of 3D Convolutional Neural Network (CNN) models specializing in these regions, combine them through score-level fusion, and use the weighted alternative. Our experimental results have shown the effectiveness of mixed convolutional models. Their fusion yields up to 19% accuracy improvement over the baseline using the full upper body. Furthermore, we include a discussion for fusion settings, which can help future work on Sign Language Translation (SLT).

机译：标志语言通过手和上半身手势表达以及面部表情。因此，手语识别（SLR）需要专注于所有此类提示。以前的工作采用手工制作的机制或网络聚合来提取不同的提示功能，以提高SLR性能。这缓慢并涉及复杂的架构。我们提出了一种更直接的方法，专注于培训专门从事主导手，手，面部和上身区域的单独提示模型。我们比较专门从事这些区域的3D卷积神经网络（CNN）模型的表现，将它们通过得分级融合，并使用加权替代品。我们的实验结果表明了混合卷积模型的有效性。它们的融合在基线上使用全部上半身的准确性提高了高达19％的准确性。此外，我们包括讨论融合设置，可以帮助未来的手语翻译（SLT）。

著录项

来源
《European conference on computer vision》|2020年|294-309|共16页
会议地点
作者
Cagrı Goekce; Ogulcan OEzdemir; Ahmet Alp Kındıroglu; Lale Akarun;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Sign language recognition; Turkish sign language (TID); 3D convolutional neural networks; Score-level fusion;

机译：手语识别;土耳其手语（TID）;3D卷积神经网络;分数级融合;

相似文献

外文文献
中文文献
专利

1. Multi-sensor data fusion for sign language recognition based on dynamic Bayesian network and convolutional neural network [J] . Xiao Qinkun, Zhao Yidan, Huan Wang Multimedia Tools and Applications . 2019,第11期

机译：基于动态贝叶斯网络和卷积神经网络的手语识别多传感器数据融合
2. Multi-sensor data fusion for sign language recognition based on dynamic Bayesian network and convolutional neural network [J] . Xiao Qinkun, Zhao Yidan, Huan Wang Multimedia Tools and Applications . 2019,第11期

机译：基于动态贝叶斯网络和卷积神经网络的手语识别多传感器数据融合
3. American Sign Language alphabet recognition using Convolutional Neural Networks with multiview augmentation and inference fusion [J] . Wenjin Tao, Ming C. Leu, Zhaozheng Yin Engineering Applications of Artificial Intelligence . 2018,第NOVa期

机译：使用卷积神经网络结合多视图增强和推理融合的美国手语字母识别
4. Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition [C] . Hao Zhou, Wengang Zhou, Yun Zhou, AAAI Conference on Artificial Intelligence . 2020

机译：用于连续手语识别的空间 - 时间多功能网络
5. American Sign Language Recognition using Adversarial Learning in a Multi-Frequency RF Sensor Network [D] . Macks, Trevor. 2020

机译：在多频RF传感器网络中使用对抗性学习的美国手语识别
6. An Recognition–Verification Mechanism for Real-Time Chinese Sign Language Recognition Based on Multi-Information Fusion [O] . Fei Wang, Shusen Zhao, Xingqun Zhou, 2019

机译：基于多信息融合的实时中文手语识别识别验证机制
7. Score-Level Multi Cue Fusion for Sign Language Recognition [O] . Çağrı Gökçe, Oğulcan Özdemir, Ahmet Alp Kındıroğlu, 2020

机译：分数级多提示融合，用于行程语言识别

Score-Level Multi Cue Fusion for Sign Language Recognition

摘要

著录项

相似文献

相关主题

期刊订阅