首页> 外文会议>International Conference on Speech and Computer >Creation and Selection of the Visual Front End Features and the Audio-Visual Feature Fusion for Audio-Visual Speech Recognition
【24h】

Creation and Selection of the Visual Front End Features and the Audio-Visual Feature Fusion for Audio-Visual Speech Recognition

机译:创建和选择视觉前端功能和视听语音识别的视听功能融合

获取原文

摘要

This contribution is about a creation and selection of the visual front end speech features. The use of the visual shape and the appearance-based visual features are described here. These visual features can be used for the visual or for the audiovisual speech recognition. Before they are used, the features have to be normalized and selected in such a way, so that the recognition rate was high enough. The second task has been the use of the fusion of different kinds of visual and acoustic speech features. The experiments for the audio-visual recognition of isolated words have been created in the conclusion of this work.
机译:此贡献是关于创建和选择视觉前端语音功能。这里描述了使用视觉形状和基于外观的视觉特征。这些可视特征可用于视觉或视听语音识别。在使用之前,必须以这种方式标准化并选择特征,使得识别率足够高。第二任务一直是使用不同类型的视觉和声学语音功能的融合。在这项工作的结论中,已经创建了对孤立词语的视听识别的实验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号