首页> 外文期刊>Information Processing & Management >A reliable FAQ retrieval system using a query log classification technique based on latent semantic analysis
【24h】

A reliable FAQ retrieval system using a query log classification technique based on latent semantic analysis

机译:使用基于潜在语义分析的查询日志分类技术的可靠FAQ检索系统

获取原文
获取原文并翻译 | 示例
           

摘要

To obtain high performances, previous works on FAQ retrieval used high-level knowledge bases or handcrafted rules. However, it is a time and effort consuming job to construct these knowledge bases and rules whenever application domains are changed. To overcome this problem, we propose a high-performance FAQ retrieval system only using users' query logs as knowledge sources. During indexing time, the proposed system efficiently clusters users' query logs using classification techniques based on latent semantic analysis. During retrieval time, the proposed system smoothes FAQs using the query log clusters. In the experiment, the proposed system outperformed the conventional information retrieval systems in FAQ retrieval. Based on various experiments, we found that the proposed system could alleviate critical lexical disagreement problems in short document retrieval. In addition, we believe that the proposed system is more practical and reliable than the previous FAQ retrieval systems because it uses only data-driven methods without high-level knowledge sources. (c) 2006 Elsevier Ltd. All rights reserved.
机译:为了获得较高的性能,以前有关FAQ检索的工作使用了高级知识库或手工制定的规则。但是,无论何时更改应用程序域,构造这些知识库和规则都是一项耗时且费力的工作。为了克服这个问题,我们提出了一种高性能的FAQ检索系统,该系统仅使用用户的查询日志作为知识源。在索引期间,所提出的系统使用基于潜在语义分析的分类技术有效地对用户的查询日志进行聚类。在检索期间,建议的系统使用查询日志群集对FAQ进行平滑处理。在实验中,提出的系统在FAQ检索中优于传统的信息检索系统。基于各种实验,我们发现所提出的系统可以缓解短文档检索中的关键词汇分歧问题。此外,我们认为,提出的系统比以前的FAQ检索系统更加实用和可靠,因为它仅使用数据驱动的方法,而没有高级知识来源。 (c)2006 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号