首页> 外文会议>International workshop on database and expert systems applications >Search Results Clustering without External Resources
【24h】

Search Results Clustering without External Resources

机译:没有外部资源的搜索结果聚类

获取原文

摘要

Our unsupervised Search Results Clustering (SRC) system partitions into clusters the top-n results returned by a search engine. We present the results of experiments with our SRC system that performs incremental clustering on document titles and snippets only and does not use external resources, yet which outperforms the best performers to date on the SemEval-2013 Task 11 gold standard. We include Latent Semantic Analysis (LSA) as an optional step, using the snippets themselves as the background corpus. We demonstrate that better results are achieved by leaving the query terms out of the clustering process, and that currently, the version without LSA outperforms the version with LSA.
机译:我们的无监督搜索结果集群(SRC)系统将搜索引擎返回的前n个结果划分为多个集群。我们展示了我们的SRC系统的实验结果,该系统仅对文档标题和摘要执行增量聚类,并且不使用外部资源,但在SemEval-2013 Task 11金牌标准方面表现最佳。我们将摘要本身作为背景语料库,包括潜在语义分析(LSA)作为可选步骤。我们证明,通过将查询词排除在集群过程之外,可以达到更好的结果,并且当前,没有LSA的版本要优于具有LSA的版本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号