首页> 外文会议>Conference on Visual Communications and Image Processing >Speech recognition for acoustic-assisted video coding and animation
【24h】

Speech recognition for acoustic-assisted video coding and animation

机译:声学辅助视频编码和动画的语音识别

获取原文

摘要

In this paper, we discuss issues related to analysis and synthesis of facial images using speech information. An approach to speaker independent acoustic-assisted image coding and animation is studied. A perceptually based sliding window encoder is proposed. It utilizes the high rate (or oversampled) acoustic viseme sequence from the audio domain for image domain viseme interpolation and smoothing. The image domain visemes in our approach are dynamically constructed from a set of basic visemes. The look-ahead and look-back moving interpolations in the proposed approach provide an effective way to compensate the mismatch between auditory and visual perceptions.
机译:在本文中,我们讨论了使用语音信息分析和合成面部图像的问题。研究了扬声器独立声学辅助图像编码和动画的方法。提出了一种基于感知的滑动窗口编码器。它利用来自音频域的高速(或过采样)声学发生序列,用于图像域状模具插值和平滑。我们方法中的图像域探测由一组基本鼠标动态构建。所提出的方法中的寻找和回顾移动插值提供了一种有效的方法来补偿听觉和视觉感知之间的不匹配。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号