首页> 外文期刊>LIPIcs : Leibniz International Proceedings in Informatics >The Smart Crowd - Learning from the Ones Who Know (Invited Talk)
【24h】

The Smart Crowd - Learning from the Ones Who Know (Invited Talk)

机译:聪明的人群-向认识的人学习(特邀演讲)

获取原文
       

摘要

One of the foremost challenges for information technology over the last few years has been to explore, understand, and extract useful information from large amounts of data. Some particular tasks such as annotating data or matching entities have been outsourced to human workers for many years. But the last few years have seen the rise of a new research field called crowdsourcing that aims at delegating a wide range of tasks to human workers, building formal frameworks, and improving the efficiency of these processes. In order to provide sound scientific foundations for crowdsourcing and support the development of efficient crowd sourcing processes, adequate formal models and algorithms must be defined. In particular, the models must formalize unique characteristics of crowd-based settings, such as the knowledge of the crowd and crowd-provided data; the interaction with crowd members; the inherent inaccuracies and disagreements in crowd answers; and evaluation metrics that capture the cost and effort of the crowd. Clearly, what may be achieved with the help of the crowd depends heavily on the properties and knowledge of the given crowd. In this talk we will focus on knowledgeable crowds. We will examine the use of such crowds, and in particular domain experts, for assisting solving data management problems. Specifically we will consider three dimensions of the problem: (1) How domain experts can help in improving the data itself, e.g. by gathering missing data and improving the quality of existing data, (2) How they can assist in gathering meta-data that facilitate improved data processing, and (3) How can we find and identify the most relevant crowd for a given data management task.
机译:过去几年中,信息技术面临的最重大挑战之一是从大量数据中探索,理解和提取有用的信息。一些特殊的任务,例如注释数据或匹配实体,已经外包给人类工作者多年了。但是在最近几年中,出现了一个新的研究领域,即“众包”,这种研究的目的是将各种各样的任务委托给人类工人,建立正式框架并提高这些过程的效率。为了为众包提供可靠的科学基础并支持有效的众包流程的开发,必须定义适当的正式模型和算法。特别是,模型必须形式化基于人群的环境的独特特征,例如人群知识和人群提供的数据;与人群成员的互动;人群回答中固有的错误和分歧;和评估指标,这些指标反映了人群的成本和精力。显然,在人群的帮助下可以实现的目标很大程度上取决于给定人群的属性和知识。在本次演讲中,我们将重点介绍知识渊博的人群。我们将研究使用此类人群,尤其是领域专家来协助解决数据管理问题。具体来说,我们将考虑问题的三个方面:(1)领域专家如何帮助改善数据本身,例如通过收集丢失的数据并提高现有数据的质量,(2)它们如何协助收集有助于改进数据处理的元数据,以及(3)如何为给定的数据管理任务找到和识别最相关的人群。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号