首页> 外文会议>Progress in pattern recognition, image analysis, computer vision, and applications >Robot Command Interface Using an Audio-Visual Speech Recognition System
【24h】

Robot Command Interface Using an Audio-Visual Speech Recognition System

机译:使用视听语音识别系统的机器人命令界面

获取原文
获取原文并翻译 | 示例

摘要

In recent years audio-visual speech recognition has emerged as an active field of research thanks to advances in pattern recognition, signal processing and machine vision. Its ultimate goal is to allow human-computer communication using voice, taking into account the visual information contained in the audio-visual speech signal. This document presents a command's automatic recognition system using audio-visual information. The system is expected to control the laparoscopic robot da Vinci. The audio signal is treated using the Mel Frequency Cepstral Coefficients parametrization method. Besides, features based on the points that define the mouth's outer contour according to the MPEG-4 standard are used in order to extract the visual speech information.
机译:近年来,由于模式识别,信号处理和机器视觉的进步,视听语音识别已成为活跃的研究领域。其最终目标是考虑到视听语音信号中包含的视觉信息,允许使用语音进行人机通信。本文档介绍了使用视听信息的命令自动识别系统。该系统有望控制腹腔镜机器人达芬奇。使用梅尔频率倒谱系数参数化方法处理音频信号。此外,为了提取视觉语音信息,使用了基于根据MPEG-4标准定义嘴部外部轮廓的点的特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号