...
首页> 外文期刊>Concurrency and computation: practice and experience >One-class support vector machines for personalized tag-based resource classification in social bookmarking systems
【24h】

One-class support vector machines for personalized tag-based resource classification in social bookmarking systems

机译:一类支持向量机,用于社会书签系统中基于个性化标签的资源分类

获取原文
获取原文并翻译 | 示例
           

摘要

Social tagging systems allow users to easily create, organize, and share collections of Web resources in a collaborative fashion. Videos, pictures, research papers, and Web pages are shared and annotated in sites such as Del.icio.us, CiteULike, or Flickr, among others. The rising popularity of these systems leads to a constant increase in the number of users actively publishing and annotating resources and, consequently, an exponential growth in the amount of data contained in their folksonomies, the underlying data structure of tagging systems. In turn, the user task of discovering interesting resources becomes more and more difficult and time-consuming. In this paper, the problem of filtering resources from social tagging systems according to individual user interests using purely tagging data is studied. One-class support vector machine classification is evaluated as a means to identify relevant information for users based exclusively on positive examples of their information preferences. It is assumed that users express their interest on resources belonging to a folksonomy by assigning tags to them, whereas there is no straightforward method to collect uninterestingness judgments. Filtering interesting resources based on social tags is an important benefit of exploiting the collective knowledge generated by tagging activities of Web communities. In this paper, the results achieved with tag-based classification are compared with those obtained using more traditional information sources such as the full text of Web pages. Experimental evaluation showed that tag-based classifiers outperformed those learned using the text of documents as well as other content-related sources. Moreover, tag-based classification becomes essential for folksonomies in which no additional content is available because of the nature of resources being stored (e.g., tagging of photos or videos). Copyright © 2012 John Wiley & Sons, Ltd.
机译:社交标记系统允许用户以协作方式轻松创建,组织和共享Web资源集合。视频,图片,研究论文和网页在Del.icio.us,CiteULike或Flickr等网站中共享和注释。这些系统的日益普及导致积极发布和注释资源的用户数量不断增加,因此,其民俗分类法(即标签系统的基础数据结构)中包含的数据量呈指数增长。反过来,发现有趣资源的用户任务变得越来越困难和耗时。本文研究了使用纯标签数据根据个人用户兴趣从社交标签系统过滤资源的问题。一类支持向量机分类被评估为一种仅基于用户信息偏好的肯定示例为用户标识相关信息的手段。假定用户通过给他们分配标签来表示对属于民俗疗法的资源的兴趣,而没有直接的方法来收集不感兴趣的判断。基于社交标签过滤有趣的资源是利用通过标记Web社区的活动生成的集体知识的重要好处。在本文中,将基于标签的分类所获得的结果与使用更传统的信息源(如网页的全文)所获得的结果进行了比较。实验评估表明,基于标签的分类器优于使用文档文本以及其他与内容相关的来源学习的分类器。此外,基于标签的分类对于由于存储的资源的性质(例如,照片或视频的标签)而没有附加内容可用的民俗分类法变得至关重要。版权所有©2012 John Wiley&Sons,Ltd.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号