首页> 外国专利> METHOD AND APPARATUS FOR VISUAL QUESTION ANSWERING, COMPUTER DEVICE AND MEDIUM

METHOD AND APPARATUS FOR VISUAL QUESTION ANSWERING, COMPUTER DEVICE AND MEDIUM

机译:用于视觉问题的方法和装置,用于视觉问题,计算机设备和介质

摘要

The present disclosure provides a method for visual question answering. The method includes: acquiring an input image and an input question; constructing a visual graph based on the input image, wherein the visual graph comprises a first node feature and a first edge feature; constructing a question graph based on the input question, wherein the question graph comprises a second node feature and a second edge feature; performing a multimodal fusion on the visual graph and the question graph to obtain an updated visual graph and an updated question graph; determining a question feature based on the input question; determining a fusion feature based on the updated visual graph, the updated question graph and the question feature; and generating a predicted answer for the input image and the input question. The present disclosure further provides an apparatus for visual question answering, a computer device and a medium.
机译:本公开提供了一种用于视觉问题的方法。该方法包括:获取输入图像和输入问题;基于输入图像构建视觉图,其中,视觉图包括第一节点特征和第一边缘特征;基于输入问题构建问题图,其中问题图包括第二节点特征和第二边缘特征;在可视图和问题图上执行多模式融合,以获得更新的可视图和更新的问题图;根据输入问题确定问题特征;基于更新的可视图,更新的问题图和问题特征确定融合功能;并生成输入图像和输入问题的预测答案。本公开还提供了一种用于视觉问题应答,计算机设备和介质的装置。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号