首页> 外国专利> TEXT UNDERSTANDING BY PICTURE-IN-PICTURE. AND ROBOT THAT ATTACH SPEECH RECOGNITION.

TEXT UNDERSTANDING BY PICTURE-IN-PICTURE. AND ROBOT THAT ATTACH SPEECH RECOGNITION.

机译:通过“画中画”理解文本。和机器人进行语音识别。

摘要

The subject innovation relates to a document image recognition and speech recognition, robotics, specifically to a system that offers a variety of document information, the audio information whether or not, storage, and output the information more conveniently. In the information processing related to image processing, document processing, audio processing, and the output storage section, consisting of a main processor document image recognition and speech recognition robot.; According to the present invention in a robot provided to recognize the image document information and audio information: receiving the input document image by the image store to read the image processing, the document image creating a clear image by removing noise change as a black and white image, and the document the document processor for storage by converting the image into text, through a microphone recognizing the external sound and filter to create a clean speech comparing the voice and audio processing unit for driving out the natural language voice, outputting the converted text documents and voice, and outputs the file is the output, a storage unit for storing a DB (database), images, documents, sound processing and the output, a document recognition unit and a robot equipped with speech recognition unit is configured by including a main controller for controlling the stored image is provided.
机译:本主题创新涉及文档图像识别和语音识别,机器人技术,尤其涉及一种提供各种文档信息,是否提供音频信息,存储并更方便地输出信息的系统。在与图像处理,文档处理,音频处理和输出存储部分有关的信息处理中,由主处理器的文档图像识别和语音识别机器人组成。根据本发明,在一种机器人中,该机器人被提供来识别图像文档信息和音频信息:通过图像存储器接收输入文档图像以读取图像处理,该文档图像通过去除作为黑白的噪声变化来创建清晰图像。图像,以及通过将图像转换为文本,通过麦克风识别外部声音并进行过滤以创建纯净语音而比较的文档处理器,用于比较语音和音频处理单元以驱除自然语言语音,并输出转换后的文本文档和语音,并且输出是文件的输出,用于存储DB(数据库),图像,文档,声音处理和输出的存储单元,文档识别单元和配备语音识别单元的机器人,包括提供了用于控制所存储的图像的主控制器。

著录项

  • 公开/公告号KR200383058Y1

    专利类型

  • 公开/公告日2005-04-29

    原文格式PDF

  • 申请/专利权人

    申请/专利号KR20040028770U

  • 发明设计人 곽영배;

    申请日2004-10-07

  • 分类号B25J11/00;G06K9/00;

  • 国家 KR

  • 入库时间 2022-08-21 22:03:01

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号