Understanding Task Design Trade-offs in Crowdsourced Paraphrase Collection

机译：了解众包释义集合中的任务设计权衡

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Linguistically diverse datasets are critical for training and evaluating robust machine learning systems, but data collection is a costly process that often requires experts. Crowdsourcing the process of paraphrase generation is an effective means of expanding natural language datasets, but there has been limited analysis of the trade-offs that arise when designing tasks. In this paper, we present the first systematic study of the key factors in crowdsourcing paraphrase collection. We consider variations in instructions, incentives, data domains, and workflows. We manually analyzed paraphrases for correctness, gram-maticality, and linguistic diversity. Our observations provide new insight into the trade-offs between accuracy and diversity in crowd responses that arise as a result of task design, providing guidance for future paraphrase generation procedures.

机译：语言上多样化的数据集对于培训和评估强大的机器学习系统至关重要，但是数据收集是一个昂贵的过程，通常需要专家。众包释义的生成过程是扩展自然语言数据集的有效手段，但是对设计任务时所产生的权衡的分析有限。在本文中，我们提出了对众包意译收集中关键因素的第一个系统研究。我们考虑指令，激励措施，数据域和工作流程的变化。我们手动分析了复述的正确性，语法功能和语言多样性。我们的观察结果提供了新的见解，以了解由于任务设计而导致的人群响应的准确性与多样性之间的取舍，为将来的复述生成程序提供了指导。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2017年|103-109|共7页
会议地点
作者
Youxuan Jiang; Jonathan K. Kummerfeld; Walter S. Lasecki;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 13:49:42

相似文献

外文文献
中文文献
专利

1. Expecting the Unexpected: Effects of Data Collection Design Choices on the Quality of Crowdsourced User-Generated Content [J] . Lukyanenko Roman, Parsons Jeffrey, Wiersma Yolanda F., MIS quarterly . 2019,第2期

机译：出乎意料：数据收集设计选择对众包用户生成内容质量的影响
2. Expecting the Unexpected: Effects of Data Collection Design Choices on the Quality of Crowdsourced User-Generated Content [J] . Lukyanenko Roman, Parsons Jeffrey, Wiersma Yolanda F., MIS quarterly . 2019,第2期

机译：期待意外：数据收集设计选择对众包用户生成内容的质量影响
3. Design-Based Economic Development: Understanding the Role of Cultural Institutions and Collections of Industrial and Product Design [J] . Kinahan Kelly L. Economic Development Quarterly: The Journal of American Economic Revitalization . 2016,第4期

机译：基于设计的经济发展：了解文化机构和工业品及产品设计收藏的作用
4. Understanding Task Design Trade-offs in Crowdsourced Paraphrase Collection [C] . Youxuan Jiang, Jonathan K. Kummerfeld, Walter S. Lasecki Annual meeting of the Association for Computational Linguistics . 2017

机译：了解众包释放收集中的任务设计权衡
5. Increasing shared understanding of a design task between designers and design environments: The role of a specification component [D] . Nakakoji, Kumiyo. 1993

机译：在设计人员和设计环境之间增进对设计任务的共享理解：规范组件的作用
6. Time use mobility and expenditure: an innovative survey design for understanding individuals’ trade-off processes [O] . Florian Aschauer, Inka Rösel, Reinhard Hössinger, -1

机译：时间使用流动性和支出：一种创新的调查设计用于了解个人的权衡过程
7. Understanding Task Design Trade-offs in Crowdsourced Paraphrase Collection [O] . Jiang, Youxuan, Kummerfeld, Jonathan K., Lasecki, Walter S. 2017

机译：理解众包释义中的任务设计权衡采集
8. Structured Analysis and Structured Design for the Logistic Support Analysis (LSA)Task 303 Evaluation of Alternatives and Trade-Off Analysis, LSA Subtask 303.2.2, Trade-Off between Support System Alternatives and System/Equipment Alternatives [R] . Duclos, R. 1991

机译：物流支持分析（Lsa）任务303的结构化分析和结构化设计替代方案和权衡分析的评估，Lsa子任务303.2.2，支持系统替代方案和系统/设备替代方案之间的权衡

Understanding Task Design Trade-offs in Crowdsourced Paraphrase Collection

摘要

著录项

相似文献

相关主题

期刊订阅