首页> 外国专利> METHOD FOR REAL-TIME LANGUAGE RECOGNITION AND SPEECH GENERATION BASED ON THREE-DIMENSIONAL VISION USING STEREO CAMERAS, AND SYSTEM USING THE SAME

METHOD FOR REAL-TIME LANGUAGE RECOGNITION AND SPEECH GENERATION BASED ON THREE-DIMENSIONAL VISION USING STEREO CAMERAS, AND SYSTEM USING THE SAME

机译：基于立体视觉的三维视觉实时语音识别与语音生成方法及系统

页面导航

摘要
著录项
相似文献

摘要

The present invention relates to a method of speech recognition and speech generation using stereo imaging technology and a system using the method. In accordance with an aspect of the present invention, there is provided a method of speech recognition and voice generation, the method comprising: providing a stereo image providing a series of left and right images of a subject, the left image and the left image provided from the stereo image providing unit A vision processing step of generating and outputting a 3D image based on the right image, a language recognition step of configuring and outputting a language intended by the subject based on the 3D image as language text, and the And a voice generation step of receiving a language text and converting the text into a voice.

机译：本发明涉及使用立体成像技术的语音识别和语音生成的方法以及使用该方法的系统。根据本发明的一方面，提供了一种语音识别和语音生成的方法，该方法包括：提供立体图像，该立体图像提供对象的一系列左右图像，所提供的左图像和左图像来自立体图像提供单元的视觉处理步骤，基于右图像生成和输出3D图像;语言识别步骤，基于该3D图像配置和输出对象期望的语言作为语言文本，以及语音生成步骤，接收语言文本并将文本转换为语音。

著录项

公开/公告号KR20110106197A

专利类型
公开/公告日2011-09-28

原文格式PDF
申请/专利权人 KOREA INSTITUTE OF SCIENCE AND TECHNOLOGY;
展开▼

申请/专利号KR20100025470
发明设计人 YOUN IN CHAN;CHOI KUI WON;SUH JUN KYO;CHU JUN UK;KWON ICK CHAN;KIM KWANG MEYUNG;CHOI SEUNG HO;KIM SANG YOON;
展开▼

申请日2010-03-22
分类号G10L15/24;G10L15/02;
国家 KR
入库时间 2022-08-21 17:51:05

相似文献

专利
外文文献
中文文献