Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading

机译：伪卷积政策梯度序列序列唇读

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Lip-reading aims to infer the speech content from the lip movement sequence and can be seen as a typical sequence-to-sequence (seq2seq) problem which translates the input image sequence of lip movements to the text sequence of the speech content. However, the traditional learning process of seq2seq models always suffers from two problems: the exposure bias resulted from the strategy of “teacher-forcing”, and the inconsistency between the discriminative optimization target (usually the cross-entropy loss) and the final evaluation metric (usually the character/word error rate). In this paper, we propose a novel pseudo-convolutional policy gradient (PCPG) based method to address these two problems. On the one hand, we introduce the evaluation metric (refers to the character error rate in this paper) as a form of reward to optimize the model together with the original discriminative target. On the other hand, inspired by the local perception property of convolutional operation, we perform a pseudo-convolutional operation on the reward and loss dimension, so as to take more context around each time step into account to generate a robust reward and loss for the whole optimization. Finally, we perform a thorough comparison and evaluation on both the word-level and sentence-level benchmarks. The results show a significant improvement over other related methods, and report either a new state-of-the-art performance or a competitive accuracy on all these challenging benchmarks, which clearly proves the advantages of our approach.

机译：唇读旨在从唇部运动序列推断语音内容，并且可以被视为典型的序列到序列（SEQ2Seq）问题，其将唇部运动的输入图像序列转换为语音内容的文本序列。然而，SEQ2Seq模型的传统学习过程始终存在两个问题：曝光偏差由“教师迫使”的策略以及歧视优化目标（通常是跨熵损失）和最终评估度量之间的不一致（通常是字符/单词错误率）。在本文中，我们提出了一种基于新的伪卷积政策梯度（PCPG）的方法来解决这两个问题。一方面，我们介绍评估度量（指本文中的字符错误率）作为一种与原始鉴别目标一起优化模型的奖励形式。另一方面，受到卷积作业的本地感知性的启发，我们对奖励和损失维度进行了伪卷积操作，以便在每次步骤中考虑更多的背景，以产生强大的奖励和损失整体优化。最后，我们对词语级和句子级基准进行彻底的比较和评估。结果表现出对其他相关方法的显着改进，并在所有这些具有挑战性的基准上报告了新的最先进的性能或竞争准确性，这明确证明了我们的方法的优势。

著录项

来源
《International Conference on Automatic Face and Gesture Recognition》|2020年|273-280|共8页
会议地点
作者
Mingshuang Luo; Shuang Yang; Shiguang Shan; Xilin Chen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Decoding; Measurement; Image sequences; Computational modeling; Task analysis; Optimization; Error analysis;

机译：解码;测量;图像序列;计算建模;任务分析;优化;错误分析;

相似文献

外文文献
中文文献
专利

1. Regularized Policy Gradients: Direct Variance Reduction in Policy Gradient Estimation [J] . Tingting ZHAO, Gang NIU, Ning XIE, 電子情報通信学会技術研究報告. 情報論的学習理論と機械学習 . 2015,第323期

机译：正则化的政策梯度：政策梯度估计中的直接方差减少
2. Leveling the Social Gradient in Health at the Local Level: Applying the Gradient Equity Lens to Norwegian Local Public Health Policy [J] . Fosse Elisabeth, Sherriff Nigel, Helgesen Marit International journal of health services: planning, administration, evaluation . 2019,第3期

机译：在地方级的社会梯度在地方一级，将梯度股权镜头应用于挪威本地公共卫生政策
3. Assessing public health policy approaches to level-up the gradient in health inequalities: the Gradient Evaluation Framework (vol 128, pg 246, 2014) [J] . Davies J. K., Sherriff N. S. Public health . 2016,第Null期

机译：评估公共卫生政策方法以提高健康不平等的梯度：梯度评估框架（第128卷，第246页，2014年）
4. Actor-only Deterministic Policy Gradient via Zeroth-order Gradient Oracles in Action Space [C] . Harshat Kumar, Dionysios S. Kalogerias, George J. Pappas, IEEE International Symposium on Information Theory . 2021

机译：仅通过Zeroth级梯度orcacles在行动空间中的演员确定型政策梯度
5. Policy-Aware Model Learning for Policy Gradient Methods [D] . Abachi, Romina . 2020

机译：政策感知模型学习策略梯度方法
6. Generalize Robot Learning From Demonstration to Variant Scenarios With Evolutionary Policy Gradient [O] . Junjie Cao, Weiwei Liu, Yong Liu, 2020

机译：从演示到具有演变策略梯度的各种方案概括机器人学习
7. Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading [O] . Mingshuang Luo, Shuang Yang, Shiguang Shan, 2020

机译：伪卷积政策梯度序列序列唇读

Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading

摘要

著录项

相似文献

相关主题

期刊订阅