首页> 外文会议>International world wide web conference >Stop Thinking, Start Tagging: Tag Semantics Emerge from Collaborative Verbosity
【24h】

Stop Thinking, Start Tagging: Tag Semantics Emerge from Collaborative Verbosity

机译:停止思考,开始标记:标记语义从协作冗长中出现

获取原文

摘要

Recent research provides evidence for the presence of emergent semantics in collaborative tagging systems. While several methods have been proposed, little is known about the factors that influence the evolution of semantic structures in these systems. A natural hypothesis is that the quality of the emergent semantics depends on the pragmatics of tagging: Users with certain usage patterns might contribute more to the resulting semantics than others. In this work, we propose several measures which enable a pragmatic differentiation of taggers by their degree of contribution to emerging semantic structures. We distinguish between categorizers, who typically use a small set of tags as a replacement for hierarchical classification schemes, and describers, who are annotating resources with a wealth of freely associated, descriptive keywords. To study our hypothesis, we apply semantic similarity measures to 64 different partitions of a real-world and large-scale folksonomy containing different ratios of categorizers and describers. Our results not only show that 'verbose' taggers are most useful for the emergence of tag semantics, but also that a subset containing only 40 % of the most 'verbose' taggers can produce results that match and even outperform the semantic precision obtained from the whole dataset. Moreover, the results suggest that there exists a causal link between the pragmatics of tagging and resulting emergent semantics. This work is relevant for designers and analysts of tagging systems interested (I) in fostering the semantic development of their platforms, (ii) in identifying users introducing "semantic noise", and (iii) in learning ontologies.
机译:最近的研究提供了在协作标记系统中存在的紧急语义存在证据。虽然已经提出了几种方法,但关于影响这些系统中语义结构演变的因素很少。自然假设是,紧急语义的质量取决于标记的语用:具有某些使用模式的用户可能会导致由此产生的语义比其他语义更多。在这项工作中,我们提出了几种措施,使其对新兴语义结构的贡献程度具有务实的分化。我们区分分类程序,他通常使用一小组标签作为分层分类方案的替代品,并描述为具有丰富自由相关的描述性关键字的资源注释资源。为学习我们的假设,我们将语义相似度措施应用于含有不同分类的不同比例和描述者的现实世界和大规模愚蠢的分区。我们的结果不仅表明“详细”标记对于标签语义的出现最有用,而且还有一个只有40%的最大“冗长”标记器的子集可以产生匹配甚至优于从中获得的语​​义精度的结果整个数据集。此外,结果表明,标记的语用与产生的紧急语义之间存在因果关系。这项工作与标记系统的设计师和分析师有关(i)促进其平台的语义发展,(ii)在识别学习本体中引入“语义噪声”的用户和(iii)的识别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号