Speech recognition for acoustic-assisted video coding and animation

机译：声学辅助视频编码和动画的语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we discuss issues related to analysis and synthesis of facial images using speech information. An approach to speaker independent acoustic-assisted image coding and animation is studied. A perceptually based sliding window encoder is proposed. It utilizes the high rate (or oversampled) acoustic viseme sequence from the audio domain for image domain viseme interpolation and smoothing. The image domain visemes in our approach are dynamically constructed from a set of basic visemes. The look-ahead and look-back moving interpolations in the proposed approach provide an effective way to compensate the mismatch between auditory and visual perceptions.

机译：在本文中，我们讨论了使用语音信息分析和合成面部图像的问题。研究了扬声器独立声学辅助图像编码和动画的方法。提出了一种基于感知的滑动窗口编码器。它利用来自音频域的高速（或过采样）声学发生序列，用于图像域状模具插值和平滑。我们方法中的图像域探测由一组基本鼠标动态构建。所提出的方法中的寻找和回顾移动插值提供了一种有效的方法来补偿听觉和视觉感知之间的不匹配。

著录项

来源
《Conference on Visual Communications and Image Processing》|1995年||共10页
会议地点
作者
Homer H. Chen; Wu Chou; Barry G. Haskell; Tsuhan Chen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Hybrid semantic and codebook mouth animation scheme for model-based coded video [J] . Al-Qayedi A., Clark A.F. Electronics Letters . 1999,第10期

机译：基于模型的编码视频混合语义和码本口动画方案
2. A MFCC-Based CELP Speech Coder for Server-Based Speech Recognition in Network Environments [J] . Jae Sam YOON, Gil Ho LEE, Hong Kook KIM IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences . 2007,第3期

机译：基于MFCC的CELP语音编码器，用于网络环境中基于服务器的语音识别
3. Algorithm for Coding Unit Partition in 3D Animation Using High Efficiency Video Coding Based on Canny Operator Segment [J] . ZHAO Hong, LI Jing-Bo, ZENG Xiang-Yan Journal of digital information management . 2016,第4期

机译：基于Canny算子分段的高效视频编码的3D动画单元划分编码算法。
4. Speech recognition for acoustic-assisted video coding and animation [C] . Homer H. Chen, ATT Bell Labs., Holmdel, Visual Communications and Image Processing '95 . 1995

机译：用于声学辅助视频编码和动画的语音识别
5. Objective speech intelligibility assessment using speech recognition and bigram statistics with application to low bit-rate codec evaluation [D] . Teng, Yan 2006

机译：使用语音识别和双字母组统计的客观语音清晰度评估及其在低比特率编解码器评估中的应用
6. Listeners Experience Linguistic Masking Release in Noise-Vocoded Speech-in-Speech Recognition [O] . Navin Viswanathan, Kostas Kokkinakis, Brittany T. Williams -1

机译：听众在噪声编码的语音语音识别中体验语言掩蔽的释放
7. 3D Visual Speech Animation Using 2D Videos [O] . Rabab Algadhy, Yoshihiko Gotoh, Steve Maddock 2019

机译：使用2D视频的3D视觉语音动画

Speech recognition for acoustic-assisted video coding and animation

摘要

著录项

相似文献

相关主题

期刊订阅