首页> 外国专利> method, apparatus, computer equipment, computer readable storage medium and computer program for visual qustion answering

method, apparatus, computer equipment, computer readable storage medium and computer program for visual qustion answering

机译:方法,装置,计算机设备,计算机可读存储介质和用于视觉问题的计算机程序

摘要

The present invention includes obtaining an input image and an input question, detecting visual information and position information of each of at least one text region of an input image, based on the visual information and position information, the at least one text region determining each semantic information and attribute information; determining a global characteristic of the input image based on the visual information, location information, semantic information and attribute information; determining a question characteristic according to an input question; , and generating an input image and a predictive answer to the input question based on the global feature and the question feature. The present invention also provides an image question-and-answer device, computer equipment, a computer-readable storage medium, and a computer program.
机译:本发明包括基于视觉信息和位置信息,获得输入图像和输入问题,检测输入图像的至少一个文本区域中的每一个的视觉信息和位置信息,该文本区域确定每个语义 信息和属性信息; 基于视觉信息,位置信息,语义信息和属性信息确定输入图像的全局特性; 根据输入问题确定问题特征; 并且基于全局特征和问题特征生成输入图像的输入图像和预测答案。 本发明还提供了一种图像问答设备,计算机设备,计算机可读存储介质和计算机程序。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号