Neural Baby Talk

机译：神经婴儿谈话

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce a novel framework for image captioning that can produce natural language explicitly grounded in entities that object detectors find in the image. Our approach reconciles classical slot filling approaches (that are generally better grounded in images) with modern neural captioning approaches (that are generally more natural sounding and accurate). Our approach first generates a sentence 'template' with slot locations explicitly tied to specific image regions. These slots are then filled in by visual concepts identified in the regions by object detectors. The entire architecture (sentence template generation and slot filling with object detectors) is end-to-end differentiable. We verify the effectiveness of our proposed model on different image captioning tasks. On standard image captioning and novel object captioning, our model reaches state-of-the-art on both COCO and Flickr30k datasets. We also demonstrate that our model has unique advantages when the train and test distributions of scene compositions - and hence language priors of associated captions - are different. Code has been made available at: https://github.com/jiasenlu/NeuralBabyTalk.

机译：我们介绍了一种用于图像字幕的新颖框架，该框架可以产生自然语言，这些自然语言明确地植基于对象检测器在图像中找到的实体。我们的方法使经典的时隙填充方法（通常在图像中更好地扎根）与现代的神经字幕方法（通常听起来更自然，更准确）相协调。我们的方法首先生成一个句子“模板”，其插槽位置明确绑定到特定图像区域。然后通过物体检测器在区域中识别的视觉概念来填充这些插槽。整个架构（句子模板生成和使用对象检测器填充插槽）是端到端可区分的。我们验证了我们提出的模型在不同图像字幕任务上的有效性。在标准图像字幕和新颖的对象字幕上，我们的模型在COCO和Flickr30k数据集上均达到了最新水平。我们还证明，当场景构图的训练和测试分布以及相关字幕的语言先验不同时，我们的模型具有独特的优势。代码已在以下位置提供：https：//github.com/jiasenlu/NeuralBabyTalk。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2018年|7219-7228|共10页
会议地点 Salt Lake City(US)
作者
Jiasen Lu; Jianwei Yang; Dhruv Batra; Devi Parikh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Detectors; Visualization; Grounding; Pediatrics; Natural languages; Dogs; Task analysis;

机译：探测器；可视化；接地；儿科;自然语言；小狗;任务分析;
入库时间 2022-08-26 14:35:32

相似文献

外文文献
中文文献
专利

1. Talking baby talk makes child learn two languages [J] . Science news . 2015,第2期

机译：说话的婴儿谈话使孩子学习两种语言
2. Baby Talk Is for Babies [J] . Rousseau Paul C. Journal of palliative medicine . 2019,第9期

机译：宝贝谈话是为了婴儿
3. 'Whiteness' and natural resource management: let's talk about race baby, let's talk about sovereignty! [J] . Searle Tania, Muller Samantha Australian geographical studies . 2019,第4期

机译：“白度”和自然资源管理：让我们谈谈种族问题，让我们谈谈主权！
4. Neural Baby Talk [C] . Jiasen Lu, Jianwei Yang, Dhruv Batra, IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：神经婴儿谈话
5. Prostatic Inflammation Triggers Voiding Dysfunction due to Neural Cross-Talk between the Prostate and Bladder. [D] . Lee, Sanghee. 2015

机译：前列腺和膀胱之间的神经交叉对话导致前列腺炎症引发呕吐功能障碍。
6. Talking to the parents of a baby who is likely to develop permanent neurological impairment following a brain insult in the perinatal period. [O] . S. Ryan 1995

机译：与可能在围产期脑部受伤后可能发展为永久性神经功能障碍的婴儿的父母交谈。
7. L1/L2, un punto de contacto; ajustes lingüísticos en registros de habla simplificados: Foreigner talk, teacher talk y baby talk. [O] . Alonso García J. 1995

机译：L1 / L2，一个接触点；简化语音记录中的语言调整：外国人交谈，老师交谈和婴儿交谈。

Neural Baby Talk

摘要

著录项

相似文献

相关主题

期刊订阅