【24h】

Optimised K-means for web search

机译:针对网络搜索的优化K均值

获取原文
获取原文并翻译 | 示例
       

摘要

With the vast amount of information available online, searching documents relevant to a given query requires the user to go through many titles and snippets. This searching time can be reduced by grouping search results into clusters so that the user can select the relevant cluster at a glance by looking at the cluster labels. A new method of search results clustering is introduced in this paper which clusters the search results using optimised K-means algorithm using the terms from URL, title tag and meta tag as features. Optimisation of K-means algorithm is done by selecting the initial centroids using scale factor method. The proposed method of clustering is compared with existing snippet clustering algorithms in terms of intra-cluster distance and inter-cluster distance. Results show that the proposed method produces high quality clusters than the existing methods.
机译:借助在线上可用的大量信息,搜索与给定查询相关的文档需要用户浏览许多标题和摘要。通过将搜索结果分组到集群中,可以减少搜索时间,以便用户可以通过查看集群标签一目了然地选择相关集群。本文提出了一种新的搜索结果聚类方法,该方法以URL,标题标签和元标签中的术语为特征,使用优化的K-means算法对搜索结果进行聚类。通过使用比例因子方法选择初始质心来完成K-means算法的优化。在集群内距离和集群间距离方面,将所提出的聚类方法与现有的摘要聚类算法进行了比较。结果表明,与现有方法相比,所提出的方法能够产生高质量的簇。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号