首页> 外文期刊>IEEE Transactions on Signal Processing >Constrained Optimization for Audio-to-Visual Conversion
【24h】

Constrained Optimization for Audio-to-Visual Conversion

机译:视听转换的约束优化

获取原文
获取原文并翻译 | 示例
       

摘要

We have developed a new audio-to-visual conversion algorithm that uses a constrained optimization approach to take advantage of dynamics of mouth movements. Based on facial muscle analysis, the dynamics of mouth movements is modeled, and constraints are obtained from it. The obtained constraints are used to estimate visual parameters from speech in a framework of hidden Markov model (HMM)-based visual parameter estimation. To solve the constrained optimization problem, the Lagrangian approach is used to transform the constrained problem into an unconstrained problem in our implementation. The proposed method is tested on various noisy environments to show its robustness and correctness. Our proposed algorithm is favorably compared with the mixture-based HMM method, which also uses audio-visual HMMs and finds optimal estimates based on a joint audio-visual probability distribution. Our proposed algorithm can estimate optimal visual parameters while satisfying the constraints and avoiding performance degradation in noisy environments.
机译:我们已经开发了一种新的视听转换算法,该算法使用约束优化方法来利用嘴巴运动的动态。基于面部肌肉分析,对嘴部运动的动力学建模,并从中获得约束。在基于隐马尔可夫模型(HMM)的视觉参数估计框架中,将获得的约束用于从语音估计视觉参数。为了解决约束优化问题,在我们的实现中使用拉格朗日方法将约束问题转化为无约束问题。所提出的方法在各种嘈杂的环境下进行了测试,以显示其鲁棒性和正确性。与基于混合的HMM方法相比,我们提出的算法具有优势,后者同时使用视听HMM并根据联合视听概率分布找到最佳估计。我们提出的算法可以在满足约束条件的同时估计最佳视觉参数,并避免在嘈杂环境中降低性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号