首页> 外文会议>International Conference on Advanced Computing and Communicaitons >An Efficient LSI based Information Retrieval Framework using Particle swarm optimization and simulated annealing approach
【24h】

An Efficient LSI based Information Retrieval Framework using Particle swarm optimization and simulated annealing approach

机译:基于高效的LSI信息检索框架,使用粒子群优化和模拟退火方法

获取原文

摘要

The number of users and the amount of information available has exploded since the advent of the World Wide Web (WWW). Most of Web users use various search engines to get specific information. A key factor in the success of Web search engines are their ability to rapidly find good quality results to the queries that are based on specific terms. This paper aims at retrieving more relevant documents from a huge corpus based on the required information. We propose a text mining framework that consists of four distinct stages: 1. Text preprocessing 2. Dimesionality Reduction using Latent Semantic Indexing 3. Clustering based on Hybrid combination of Particle Swarm Optimization (PSO) and k-means Algorithm 4. Information Retrieval Process using Simulated Annealing (SA). This framework provides more relevant documents to the user and reduces the irrelevant documents.
机译:自世界范围(WWW)的出现以来,用户数量和可用信息数量已爆炸。大多数Web用户使用各种搜索引擎获取特定信息。 Web搜索引擎成功的关键因素是他们能够快速找到基于特定术语的查询的良好质量结果。本文旨在根据所需信息从庞大的语料库中检索更多相关文件。我们提出了一种由四个不同阶段组成的文本挖掘框架:1。文本预处理2.使用潜在语义索引的调调减少3.基于粒子群优化(PSO)和K-mean算法4的混合组合的聚类。信息检索过程使用模拟退火(SA)。该框架向用户提供了更多相关文档并减少了无关的文件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号