首页> 外文会议>Conference of the Information Resources Management Association >Dynamic Indexing of Information in the Web: the Case of News Sites
【24h】

Dynamic Indexing of Information in the Web: the Case of News Sites

机译:网络中信息的动态索引:新闻网站的情况

获取原文

摘要

This paper presents a solution to keep available up-to-date information in a search engine whose scope is the content available within news web sites. This solution is based on the use of non-uniform policy to update the documents belonging to this scope. In order to use the non-uniform policy, we identify the most and the least recently updated documents, based on the idea in which it is supposed that the closest documents of the root of a site are the most modified ones. This hypothesis was verified through cm experiment within news sites. In order to demonstrate the efficiency of our solution regarding a traditional one, we performed a case study whose results showed that: our solution spent less time to make the new information available, it made fewer requests to the web server, it kept a high freshness of the scope and, finally, it kept the search engine index up-to-date for a much longer time than the traditional solution.
机译:本文介绍了一个解决方案,可以在搜索引擎中保留最新信息,其范围是新闻网站中可用的内容。此解决方案基于使用非统一策略来更新属于此范围的文档。为了使用非统一的策略,我们确定最新更新的文档,基于该想法,其中包含一个站点的根目录的最近文档是最修改的。通过新闻网站内的CM实验验证了该假设。为了展示我们对传统方法的解决方案的效率,我们进行了一个案例研究,其结果表明:我们的解决方案花费更少的时间来制作可用的新信息,它对Web服务器的要求较少,它保持高新度其中的范围,最后,它将搜索引擎索引保持最新的时间比传统解决方案更长的时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号