【24h】

Anchor point indexing in Web document retrieval

机译:Web文档检索中的锚点索引

获取原文
获取原文并翻译 | 示例
           

摘要

Traditional World Wide Web search engines, such as AltaVista.com, index and recommend individual Web pages to assist users in locating relevant documents. As the Web grows, however, the number of matching pages increases at a tremendous rate. Users are often overwhelmed by the large answer set recommended by the search engines. Also, if a matching document is a hypertext, the document structure is destroyed and the individual pages that compose the document are returned instead. The logical starting point of the hyperdocument is thus hidden among the large basket of matching pages. Users need to spend a lot of effort browsing through the pages to locate the starting point, a very time consuming process. This paper studies the anchor point indexing problem. The set of anchor points of a given user query is a small set of key pages from which the larger set of documents that are relevant to the query can be easily reached. The use of anchor points helps solve the problems of huge answer set and low precision suffered by most search engines by considering the hyperlink structures of the relevant documents, and by providing a summary view of the result set.
机译:传统的万维网搜索引擎(例如AltaVista.com)会索引并推荐各个网页,以帮助用户查找相关文档。但是,随着Web的发展,匹配页面的数量正以惊人的速度增长。搜索引擎推荐的大型答案通常使用户不知所措。同样,如果匹配的文档是超文本,则文档结构将被破坏,而返回组成该文档的各个页面。因此,超文档的逻辑起点隐藏在大篮子的匹配页面中。用户需要花费大量精力浏览页面以找到起点,这是一个非常耗时的过程。本文研究了锚点索引问题。给定用户查询的定位点集是一小组关键页面,从中可以轻松找到与查询相关的较大文档集。锚点的使用通过考虑相关文档的超链接结构并提供结果集的摘要视图,有助于解决大多数搜索引擎所遇到的答案集庞大和精度低的问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号