首页> 外文会议>IEEE Southwest Symposium on Image Analysis and Interpretation >Compression enhancement of video motion of mouth region using joint audio and video coding
【24h】

Compression enhancement of video motion of mouth region using joint audio and video coding

机译:使用关节音频和视频编码压缩口腔区域视频运动的增强

获取原文

摘要

We propose an application that utilizes audio and video data dependencies to achieve additional video compression in low-bit rate encoding systems such as: H.263+ video coding and G.723.1 audio coding standards. The joint correlation of synchronized audio and motion parameters has been proved to exist. A joint performance of Principal Component Analysis (PCA) by Karhunen-Loeve expansions (KL) and Tree-Structured Vector Quantization algorithms (TSVQ) based on LindeBuzo-Gray (LBG) and Competitive Learning (CL) techniques achieve as much as 60% bit reduction for the motion in the mouth region (1% of the overall output bit rate of a P frame) and provide the same motion-compensated image quality in high picture formats. We show performance evaluations that determine the optimal audio parameters, such as Linear Predictive Coefficients (LPC) or Line Spectrum Fairs (LSP), and determine the nature of the motion parameter in each macroblock of the mouth region when using Advanced Prediction Mode (APM) video coding.
机译:我们提出了一个应用程序,该应用程序利用音频和视频数据依赖性来实现低比特率编码系统中的额外视频压缩,例如:H.263 +视频编码和G.723.1音频编码标准。已经证明了同步音频和运动参数的关节相关性。基于Lindebuzo-Gray(LBG)和竞争学习(CL)技术的Karhunen-Loeve扩展(KL)和树结构矢量量化算法(TSVQ)的主成分分析(PCA)的联合性能和竞争学习(CL)技术实现多达60%减少口腔区域的运动(P帧的总输出比特率的1%),并以高图像格式提供相同的运动补偿图像质量。我们显示了确定最佳音频参数的性能评估,例如线性预测系数(LPC)或线频率(LSP),并且在使用高级预测模式(APM)时确定嘴区域的每个宏块中的运动参数的性质视频编码。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号