首页> 外文期刊>系统工程与电子技术(英文版) >Tag clustering algorithm LMMSK:improved K-means algorithm based on latent semantic analysis
【24h】

Tag clustering algorithm LMMSK:improved K-means algorithm based on latent semantic analysis

机译:标签聚类算法LMMSK:基于潜在语义分析的改进K-means算法

获取原文
获取原文并翻译 | 示例
       

摘要

With the wide application of Web 2.0 and social soft-ware, there are more and more tag-related studies and applica-tions. Because of the randomness and the personalization in users' tagging, tag research continues to encounter data space and se-mantics obstacles. With the min-max similarity (MMS) to establish the initial centroids, the traditional K-means clustering algorithm is firstly improved to the MMSK-means clustering algorithm, the superiority of which has been tested; based on MMSK-means and combined with latent semantic analysis (LSA), here secondly emerges a new tag clustering algorithm, LMMSK. Finally, three al-gorithms for tag clustering, MMSK-means, tag clustering based on LSA (LSA-based algorithm) and LMMSK, have been run on Mat-lab, using a real tag-resource dataset obtained from the Delicious Social Bookmarking System from 2004 to 2009. LMMSK's cluster-ing result turns out to be the most effective and the most accurate. Thus, a better tag-clustering algorithm is found for greater appli-cation of social tags in personalized search, topic identification or knowledge community discovery. In addition, for a better compar-ison of the clustering results, the clustering corresponding results matrix (CCR matrix) is proposed, which is promisingly expected to be an effective tool to capture the evolutions of the social tagging system.
机译:随着Web 2.0和社交软件的广泛应用,越来越多的标签相关的研究和应用。由于用户标签的随机性和个性化,标签研究继续遇到数据空间和语义障碍。利用最小-最大相似度(MMS)建立初始质心,首先将传统的K-means聚类算法改进为MMSK-means聚类算法,并验证了其优越性。基于MMSK-means算法,并结合潜在语义分析(LSA),提出了一种新的标签聚类算法LMMSK。最后,使用从Delicious Delicious Social Bookmarking System获得的真实标签资源数据集,在Mat-lab上运行了三种标签聚类算法:MMSK-均值,基于LSA(基于LSA的算法)和LMMSK的标签聚类。从2004年到2009年。LMMSK的聚类结果被证明是最有效和最准确的。因此,找到了一种更好的标签聚类算法,可以在个性化搜索,主题识别或知识社区发现中更好地应用社交标签。此外,为了更好地比较聚类结果,提出了聚类对应的结果矩阵(CCR矩阵),有望被期望成为捕捉社会标签系统演变的有效工具。

著录项

  • 来源
    《系统工程与电子技术(英文版)》 |2017年第2期|374-384|共11页
  • 作者

    Jing Yang; Jun Wang;

  • 作者单位

    School of Economics and Management, Beihang University, Beijing 100191, China;

    School of Economics and Management, Beihang University, Beijing 100191, China;

  • 收录信息 中国科学引文数据库(CSCD);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 13:04:46
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号