首页> 外文会议>IEEE Winter Conference on Applications of Computer Vision >Neural Sign Language Synthesis: Words Are Our Glosses
【24h】

Neural Sign Language Synthesis: Words Are Our Glosses

机译:神经手语综合:单词是我们的意思

获取原文

摘要

This paper deals with a text-to-video sign language synthesis. Instead of direct video production, we focused on skeletal models production. Our main goal in this paper was to design a fully end-to-end automatic sign language synthesis system trained only on available free data (daily TV broadcasting). Thus, we excluded any manual video annotation. Furthermore, our designed approach even do not rely on any video segmentation. A proposed feed-forward transformer and recurrent transformer were investigated. To improve the performance of our sequence-to-sequence transformer, soft non-monotonic attention was employed in our training process. A benefit of character-level features was compared with word-level features. We focused our experiments on a weather forecasting dataset in the Czech Sign Language.
机译:本文涉及文本到视频的手语合成。代替直接视频制作,我们专注于骨骼模型​​制作。本文的主要目的是设计一个仅在可用免费数据上进行训练的完全端到端自动手语合成系统(每日电视广播)。因此,我们排除了任何手动视频注释。此外,我们设计的方法甚至不依赖任何视频分割。研究了提出的前馈变压器和递归变压器。为了提高序列转换器的性能,我们在培训过程中采用了非单调的软性注意。将字符级功能的优势与单词级功能进行了比较。我们将实验重点放在了捷克手语的天气预报数据集上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号