首页> 外文期刊>Automation and Remote Control >An automatic multimodal speech recognition system with audio and video information
【24h】

An automatic multimodal speech recognition system with audio and video information

机译:具有音频和视频信息的自动多模式语音识别系统

获取原文
获取原文并翻译 | 示例
           

摘要

The mathematical model and software implementation of an automatic Russian speech recognition system that employs techniques of digital processing and analysis of audiovisual signals from a microphone and a video camera are presented. The description of probabilistic modeling of audiovisual speech based on coupled hidden Markov models, information fusion methods with weight coefficients for audio and video speech modalities, and parametric representation of signals is provided. Quantitative results in multimodal recognition of continuous Russian speech indicate high accuracy and reliability of the automatic system.
机译:本文介绍了自动俄语语音识别系统的数学模型和软件实现,该系统采用了数字处理技术以及对来自麦克风和摄像机的视听信号进行分析的技术。提供了基于耦合隐马尔可夫模型的视听语音概率模型的描述,具有音频和视频语音模态权重系数的信息融合方法以及信号的参数表示。多模式识别连续俄语语音的定量结果表明该自动系统具有很高的准确性和可靠性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号