首页> 外文期刊>International journal of knowledge and web intelligence >Incremental models for query clustering and query-context aware document clustering
【24h】

Incremental models for query clustering and query-context aware document clustering

机译:用于查询聚类和查询上下文感知的文档聚类的增量模型

获取原文
获取原文并翻译 | 示例
       

摘要

The traditional query clustering algorithms are designed to work on previously collected data from query stream. These algorithms become less and less effective with time because users' interests, query meaning and popularity of topics change over time. So, there is a need for incremental algorithms which can accommodate the concept drift that surface with new data being added to the collection without performing a complete re-clustering. We have proposed an incremental model for query and query-context aware document clustering. The model periodically updates new information efficiently and can be applied in a distributed environment. The proposed incremental model retains the quality of both query and document clusters. The proposed model can be applied to the results of hierarchical query clustering algorithms that produce query and document clusters. The model is tested on three hierarchical clustering algorithms on different datasets including TREC session track 2011 dataset. We have also experimented with the variant of the proposed incremental model for comparing the performance. The proposed model and its variant not only achieve accuracy very close to that of static models in all the experiments, but also offer a significant speedup.
机译:传统的查询聚类算法旨在处理先前从查询流中收集的数据。随着时间的流逝,这些算法的效率越来越低,因为用户的兴趣,查询的含义和主题的受欢迎程度都在变化。因此,需要一种增量算法,该算法可以适应表面的概念漂移,同时将新数据添加到集合中,而无需执行完整的重新聚类。我们提出了一种用于查询和支持查询上下文的文档聚类的增量模型。该模型可以定期有效地更新新信息,并且可以在分布式环境中应用。提议的增量模型保留了查询和文档集群的质量。所提出的模型可以应用于产生查询和文档聚类的分层查询聚类算法的结果。该模型在包括TREC会话跟踪2011数据集在内的不同数据集上的三种层次聚类算法上进行了测试。我们还对提议的增量模型的变体进行了实验,以比较性能。所提出的模型及其变体不仅在所有实验中都获得了与静态模型非常接近的精度,而且还提供了显着的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号