【24h】

Workshop Program

机译:工作坊计划

获取原文
获取原文并翻译 | 示例

摘要

A successful autonomous system needs to not only understand the visual world but also communicate its understanding with humans. To make this possible, language can serve as a natural link between high level semantic concepts and low level visual perception. In this talk, I'll discuss recent work in the domain of vision and language, covering topics such as image/video captioning and retrieval, and question-answering. I'll also talk about our recent work on task execution via language instructions.
机译:一个成功的自治系统不仅需要理解视觉世界,而且还需要与人类进行交流。为了使之成为可能,语言可以充当高级语义概念和低级视觉感知之间的自然链接。在本演讲中,我将讨论视觉和语言领域的最新工作,涵盖图像/视频字幕和检索以及问题解答等主题。我还将谈论我们最近在通过语言指令执行任务方面的工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号