Widget Captioning: Generating Natural Language Description for Mobile User Interface Elements

机译：小部件字幕：为移动用户界面元素生成自然语言描述

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Natural language descriptions of user interface (UI) elements such as alternative text are crucial for accessibility and language-based interaction in general. Yet, these descriptions are constantly missing in mobile UIs. We propose widget captioning, a novel task for automatically generating language descriptions for UI elements from multimodal input including both the image and the structural representations of user interfaces. We collected a large-scale dataset for widget captioning with crowd-sourcing. Our dataset contains 162,859 language phrases created by human workers for annotating 61,285 UI elements across 21,750 unique UI screens. We thoroughly analyze the dataset, and train and evaluate a set of deep model configurations to investigate how each feature modality as well as the choice of learning strategies impact the quality of predicted captions. The task formulation and the dataset as well as our benchmark models contribute a solid basis for this novel multimodal captioning task that connects language and user interfaces.

机译：用户界面（UI）元素（如替代文本）的自然语言描述对于常规的可访问性和基于语言的交互至关重要。然而，移动UI中的这些描述不断丢失。我们提出了小部件标题，一种新的任务，用于从多模式输入自动生成UI元素的语言描述，包括图像和用户界面的结构表示。我们收集了一个带人群采购的小部件标题的大规模数据集。我们的数据集包含由人工人员创建的162,859个语言短语，用于注释21,750个独特的UI屏幕上的61,285个UI元素。我们彻底分析了数据集，然后培训并评估了一组深度模型配置，以研究每个功能的模式以及学习策略的选择会影响预测标题的质量。任务制定和数据集以及我们的基准模型为连接语言和用户界面的新多模式标题任务提供了坚实的基础。

著录项

来源
《Conference on Empirical Methods in Natural Language Processing》|2020年|5495-5510|共16页
会议地点
作者
Yang Li; Gang Li; Luheng He; Jingjie Zheng; Hong Li; Zhiwei Guan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Bridging the gap between a behavioural formal description technique and a user interface description language: Enhancing ICO with a graphical user interface markup language [J] . Eric Barboni, Celia Martinie, David Navarre, Science of Computer Programming . 2014,第juna15期

机译：弥合行为形式描述技术和用户界面描述语言之间的鸿沟：通过图形用户界面标记语言增强ICO
2. Natural language-based user interface for mobile devices with limited resources [J] . So-Young Park, Jeunghyun Byun, Hae-Chang Rim, Consumer Electronics, IEEE Transactions on . 2010,第4期

机译：资源有限的移动设备的基于自然语言的用户界面
3. Retrospective on UI description languages, based on eight years' experience with the User Interface Markup Language (UIML) [J] . James Helms, Marc Abrams International Journal of Web Engineering and Technology . 2008,第2期

机译：基于八年的用户界面标记语言（UIML）经验，回顾了UI描述语言
4. Extendable Dialog Script Description Language for Natural Language User Interfaces [C] . Kiyoshi Nitta International Conference on Intelligent Systems and Applications . 2013

机译：可扩展的对话框脚本用于自然语言用户界面的语言
5. Achieving intelligent task-based mobile widget organization and customization through machine learning techniques and user model generalization. [D] . Hood, Benjamin R. 2010

机译：通过机器学习技术和用户模型归纳，实现基于任务的智能移动小部件的组织和自定义。
6. Natural Antisense Transcripts at the Interface between Host Genome and Mobile Genetic Elements [O] . Hany S. Zinad, Inas Natasya, Andreas Werner -1

机译：宿主基因组与移动遗传元件之间界面的天然反义转录本
7. Bridging the Gap between a Behavioural Formal Description Technique and User Interface description language: Enhancing ICO with a Graphical User Interface markup language [O] . Barboni Eric, Martinie Célia, Navarre David, 2014

机译：弥合行为形式描述技术和用户界面描述语言之间的差距：使用图形用户界面标记语言增强ICO

Widget Captioning: Generating Natural Language Description for Mobile User Interface Elements

摘要

著录项

相似文献

相关主题

期刊订阅