首页> 外文会议> >Viseme recognition - a comparative study
【24h】

Viseme recognition - a comparative study

机译:Viseme识别-比较研究

获取原文

摘要

Three classification algorithms for visual mouth appearances (visemes) which correspond to phonemes and their speech contexts, were compared w.rt. recognition rate, time complexity, and ROC performance. Two feature extraction procedures were verified. The first one is based on the normalized triangle MESH covering mouth area and the color image texture vector indexed by barycentric coordinates. The second procedure performs DFT on the image rectangle including mouth w.rt. small blocks of DFT coefficients. The classifiers has been designed by PCA approach and by the optimized LDA method which uses two singular subspaces approach. It appears that DFT+LDA exhibits higher recognition rate than MESH+LDA and MESH+PCA methods - 97.6% versus 94.4 and 90.2%, respectively. It is also much faster than MESH+PCA (5 ms per one video frame versus 26 ms on Pentium IV, 3.2 GHz) and slower than MESH+LDA (5 ms versus 1 ms).
机译:比较了三种对应于音素及其语音环境的视觉嘴部外观(音位)的分类算法。识别率,时间复杂度和ROC性能。验证了两种特征提取程序。第一个基于覆盖嘴区域的归一化三角形MESH和由重心坐标索引的彩色图像纹理矢量。第二个过程对包括嘴w.rt的图像矩形执行DFT。 DFT系数的小块。通过PCA方法和使用两个奇异子空间方法的优化LDA方法设计了分类器。似乎DFT + LDA的识别率比MESH + LDA和MESH + PCA方法要高-分别为97.6%和94.4%和90.2%。它也比MESH + PCA(每一个视频帧5毫秒,奔腾IV,3.2 GHz的26毫秒)要快得多,并且比MESH + LDA(5毫秒对1毫秒)要慢。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号