【24h】

Crowdsourcing Taxonomies

机译:众包分类法

获取原文
获取原文并翻译 | 示例

摘要

Taxonomies are great for organizing and searching web content. As such, many popular classes of web applications, utilize them. However, their manual generation and maintenance by experts is a time-costly procedure, resulting in static taxonomies. On the other hand, mining and statistical approaches may produce low quality taxonomies. We thus propose a drastically new approach, based on the proven, increased human involvement and desire to tag/annotate web content. We define the required input from humans in the form of explicit structural, e.g., supertype-subtype relationships between concepts. Hence we harvest, via common annotation practices, the collective wisdom of users with respect to the (categorization of) web content they share and access. We further define the principles upon which crowdsourced taxonomy construction algorithms should be based. The resulting problem is NP-Hard. We thus provide and analyze heuristic algorithms that aggregate human input and resolve conflicts. We evaluate our approach with synthetic and real-world crowdsourcing experiments and on a real-world taxonomy.
机译:分类法非常适合组织和搜索Web内容。这样,许多流行的Web应用程序类别都利用它们。但是,由专家手动生成和维护它们是耗时的过程,从而导致静态分类法。另一方面,挖掘和统计方法可能会产生低质量的分类法。因此,我们基于已证实的,越来越多的人类参与以及标记/注释Web内容的愿望,提出了一种全新的方法。我们以明确的结构形式(例如概念之间的超型-亚型关系)定义了人类所需的输入。因此,我们通过常见的注释实践,收获了用户在共享和访问的Web内容(分类)方面的集体智慧。我们进一步定义了众包分类法构建算法应基于的原则。产生的问题是NP-Hard。因此,我们提供并分析了启发式算法,这些算法可汇总人工输入并解决冲突。我们通过综合和真实的众包实验以及真实的分类法评估我们的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号