...
首页> 外文期刊>International journal of information retrieval research >Optimal Query Generation for Hidden Web Extraction through Response Analysis
【24h】

Optimal Query Generation for Hidden Web Extraction through Response Analysis

机译:通过响应分析隐藏Web提取的最佳查询生成

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

A huge number of Hidden Web databases exists over the WWW forming a massive source of high quality information. Retrieval of this information for enriching the repository of the search engine is the prime target of a Hidden web crawler. Besides this, the crawler should perform this task at an affordable cost and resource utilization. This paper proposes a Random ranking mechanism whereby the queries to be raised by the hidden web crawler have been ranked. By ranking the queries according to the proposed mechanism, the Hidden Web crawler is able to make an optimal choice among the candidate queries and efficiently retrieve the Hidden web databases. The Hidden Web crawler proposed here also possesses an extensible and scalable framework to improve the efficiency of crawling. The proposed approach has also been compared with other methods of Hidden Web crawling existing in the literature.
机译:WWW上存在大量隐藏的Web数据库,形成了大量高质量信息。检索此信息以丰富搜索引擎的存储库是隐藏Web搜寻器的主要目标。除此之外,搜寻器还应该以可承受的成本和资源利用来执行此任务。本文提出了一种随机排名机制,通过该机制可以对隐藏的Web爬网程序要提出的查询进行排名。通过根据提出的机制对查询进行排名,隐藏Web搜寻器可以在候选查询中做出最佳选择,并有效地检索隐藏Web数据库。这里提出的隐藏Web搜寻器还拥有可扩展和可扩展的框架,以提高搜寻效率。还将该提议的方法与文献中存在的其他隐藏Web爬网方法进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号