首页> 外国专利> METHOD FOR REAL-TIME LANGUAGE RECOGNITION AND SPEECH GENERATION BASED ON THREE-DIMENSIONAL VISION USING STEREO CAMERAS, AND SYSTEM USING THE SAME

METHOD FOR REAL-TIME LANGUAGE RECOGNITION AND SPEECH GENERATION BASED ON THREE-DIMENSIONAL VISION USING STEREO CAMERAS, AND SYSTEM USING THE SAME

机译:基于立体视觉的三维视觉实时语音识别与语音生成方法及系统

摘要

The present invention relates to a method of speech recognition and speech generation using stereo imaging technology and a system using the method. In accordance with an aspect of the present invention, there is provided a method of speech recognition and voice generation, the method comprising: providing a stereo image providing a series of left and right images of a subject, the left image and the left image provided from the stereo image providing unit A vision processing step of generating and outputting a 3D image based on the right image, a language recognition step of configuring and outputting a language intended by the subject based on the 3D image as language text, and the And a voice generation step of receiving a language text and converting the text into a voice.
机译:本发明涉及使用立体成像技术的语音识别和语音生成的方法以及使用该方法的系统。根据本发明的一方面,提供了一种语音识别和语音生成的方法,该方法包括:提供立体图像,该立体图像提供对象的一系列左右图像,所提供的左图像和左图像来自立体图像提供单元的视觉处理步骤,基于右图像生成和输出3D图像;语言识别步骤,基于该3D图像配置和输出对象期望的语言作为语言文本,以及语音生成步骤,接收语言文本并将文本转换为语音。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号