【24h】

Cascade: Crowdsourcing Taxonomy Creation

机译:级联:众包分类法创建

获取原文

摘要

Taxonomies are a useful and ubiquitous way of organizing information. However, creating organizational hierarchies is difficult because the process requires a global understanding of the objects to be categorized. Usually one is created by an individual or a small group of people working together for hours or even days. Unfortunately, this centralized approach does not work well for the large, quickly-changing datasets found on the web. Cascade is an automated workflow that creates a taxonomy from the collective efforts of crowd workers who spend as little as 20 seconds each. We evaluate Cascade and show that on three datasets its quality is 80-90% of that of experts. The cost of Cascade is competitive with expert information architects, despite taking six times more human labor. Fortunately, this labor can be parallelized such that Cascade will run in as fast as five minutes instead of hours or days.
机译:分类法是组织信息的有用且无处不在的方式。但是,创建组织层次结构很困难,因为该过程需要全局了解要分类的对象。通常一个人是由一个人或一小群人一起工作几个小时甚至几天而创建的。不幸的是,这种集中式方法不适用于网络上快速变化的大型数据集。 Cascade是一种自动化的工作流程,可通过人群工作人员的共同努力来创建分类法,每个工作人员仅花费20秒的时间。我们评估了Cascade,并显示在三个数据集上,其质量是专家质量的80-90%。尽管要花费六倍的人力,但是Cascade的成本与专家信息架构师相比具有竞争力。幸运的是,这项工作可以并行进行,以便Cascade最快可以在5分钟内运行,而不是数小时或数天。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号