首页> 外文会议>12th Asia Pacific Web Conference (APWeb 2010) >An Asymmetric Similarity Measure for Tag Clustering on Flickr
【24h】

An Asymmetric Similarity Measure for Tag Clustering on Flickr

机译:Flickr上标签聚类的非对称相似性度量

获取原文
获取原文并翻译 | 示例

摘要

Web 2.0 tools and environments have made tagging, the act of assigning keywords to on-line objects, a popular way to annotate shared resources. The success of now-prominent tagging systems makes tagging "the natural way for people to classify objects as well as an attractive way to discover new material". One of the most challenging problems is to harvest the semantics from these systems, which can support a number of applications, including tag clustering and tag recommendation. We conduct detailed studies on different types of tag relations and tag similarity measures, and propose a scalable measure that we name Reliability Factor Similarity Measure (RFSM). We compare it with two other measures having similar scalability by integrating them into hierarchical clustering methods and performing tag clustering on a subset of Flickr data. The results suggest that RFSM outperforms those two measures when it is applies for tag clustering purpose. We also present an alternative way of utilizing discovered tag relations to set up tag refining rules in order to deal with some noise in the initial tag sets, which can in turn improve the precision of tag relations.
机译:Web 2.0工具和环境已经使标记,为在线对象分配关键字的行为成为一种注释共享资源的流行方法。现在著名的标记系统的成功使标记成为“人们对物体进行分类的自然方式,以及发现新材料的一种有吸引力的方式”。最具挑战性的问题之一是从这些系统中获取语义,这可以支持许多应用程序,包括标签聚类和标签推荐。我们对不同类型的标签关系和标签相似性度量进行了详细的研究,并提出了一种可扩展的度量,我们将其称为可靠性因子相似性度量(RFSM)。通过将它们集成到分层聚类方法中,并对Flickr数据的子集执行标签聚类,我们将其与具有类似可伸缩性的其他两个措施进行了比较。结果表明,当RFSM用于标签聚类时,其性能优于这两种措施。我们还提出了一种利用发现的标签关系来设置标签优化规则的替代方法,以便处理初始标签集中的一些噪声,从而可以提高标签关系的精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号