首页> 外文会议>International Conference on Speech and Computer >Audio-Visual Speech Recognition for Slavonic Languages (Czech and Russian)
【24h】

Audio-Visual Speech Recognition for Slavonic Languages (Czech and Russian)

机译:斯拉夫语言的视听语音识别(捷克语和俄语)

获取原文

摘要

The paper presents the results of recent experiments with audio-visual speech recognition for two popular Slavonic languages: Russian and Czech. The description of test applied tasks, the process of multimodal databases collection and data pre-processing, methods for visual features extraction (geometric shape-based features; DCT and PCA pixel-based visual parameterization) as well as models of audio-visual recognition (concatenation of feature vectors and multi-stream models) are described. The prototypes of applied systems which will use the audio-visual speech recognition engine are mainly directed to the market of intellectual applications such as inquiry machines, video conference communications, moving objects control in noisy environments, etc.
机译:本文介绍了近期实验对两个流行斯拉夫语言的视听语音识别的结果:俄罗斯和捷克。测试应用任务的描述,多模式数据库收集和数据预处理的过程,用于视觉特征的方法提取(基于几何形状的特征; DCT和基于PCA像素的Visual参数化)以及音频视觉识别的模型(描述了特征向量和多流模型的串联。将使用视听语音识别发动机的应用系统的原型主要针对智力应用的市场,例如查询机,视频会议通信,在嘈杂环境中移动物体控制等。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号