首页> 外文会议>Conference on Empirical Methods in Natural Language Processing >Widget Captioning: Generating Natural Language Description for Mobile User Interface Elements
【24h】

Widget Captioning: Generating Natural Language Description for Mobile User Interface Elements

机译:小部件字幕:为移动用户界面元素生成自然语言描述

获取原文

摘要

Natural language descriptions of user interface (UI) elements such as alternative text are crucial for accessibility and language-based interaction in general. Yet, these descriptions are constantly missing in mobile UIs. We propose widget captioning, a novel task for automatically generating language descriptions for UI elements from multimodal input including both the image and the structural representations of user interfaces. We collected a large-scale dataset for widget captioning with crowd-sourcing. Our dataset contains 162,859 language phrases created by human workers for annotating 61,285 UI elements across 21,750 unique UI screens. We thoroughly analyze the dataset, and train and evaluate a set of deep model configurations to investigate how each feature modality as well as the choice of learning strategies impact the quality of predicted captions. The task formulation and the dataset as well as our benchmark models contribute a solid basis for this novel multimodal captioning task that connects language and user interfaces.
机译:用户界面(UI)元素(如替代文本)的自然语言描述对于常规的可访问性和基于语言的交互至关重要。然而,移动UI中的​​这些描述不断丢失。我们提出了小部件标题,一种新的任务,用于从多模式输入自动生成UI元素的语言描述,包括图像和用户界面的结构表示。我们收集了一个带人群采购的小部件标题的大规模数据集。我们的数据集包含由人工人员创建的162,859个语言短语,用于注释21,750个独特的UI屏幕上的61,285个UI元素。我们彻底分析了数据集,然后培训并评估了一组深度模型配置,以研究每个功能的模式以及学习策略的选择会影响预测标题的质量。任务制定和数据集以及我们的基准模型为连接语言和用户界面的新多模式标题任务提供了坚实的基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号