【24h】

Data Fusion for Geometrical and Pixel Based Lip Feature

机译:基于几何和像素的唇部特征的数据融合

获取原文

摘要

Lipreading is applied to synthesize speech for the speech-impaired people. To get a higher recognition result, data fusion with weighting coefficients at feature level is used to integrate the lip information from different kinds of lip features. Experiments are carried out based on HMM with different states and Gaussian mixture component in a small database for speaker-dependent case. Experiment results showed that the integrated discriminate vector after feature fusion obtains the information from the Geometrical feature vector of lip region and the DCT coefficients of lip’ ROI. With best weighting coefficients m: n=1.5:1, the recognition rate are improved by as much as 5.02% and 8.37%, respectively.
机译:Lipreading适用于综合演讲障碍人的言论。为了获得更高的识别结果,使用特征级别的加权系数的数据融合用于从不同种类的唇部特征集成唇部信息。实验基于HMM在小型数据库中基于具有不同状态和高斯混合组分的嗯,用于扬声器依赖案例。实验结果表明,特征融合后的集成区分矢量从唇部区域的几何特征向量和唇缘的DCT系数获得信息。最佳加权系数M:n = 1.5:1,识别率分别提高5.02%和8.37%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号