【24h】

Querying the World Wide Web

机译:查询万维网

获取原文

摘要

The World Wide Web is a large, heterogeneous, distributed collection of documents connected by hypertext links. The most common technology currently used for searching the Web depends on sending information retrieval requests to "index servers". One problem with this is that these queries cannot exploit the structure and topology of the document network. The authors propose a query language, WebSQL, that takes advantage of multiple index servers without requiring users to know about them, and that integrates textual retrieval with structure and topology-based queries. They give a formal semantics for WebSQL using a calculus based on a novel "virtual graph" model of a document network. They propose a new theory of query cost based on the idea of "query locality," that is, how much of the network must be visited to answer a particular query. Finally, they describe a prototype implementation of WebSQL written in Java.
机译:万维网是由超文本链接连接的大型异构,分布式的文件集合。目前用于搜索Web的最常用技术取决于向“索引服务器”发送信息检索请求。其中一个问题是,这些查询无法利用文档网络的结构和拓扑。作者提出了一种查询语言WebSQL,它利用多个索引服务器,而无需用户了解它们,并且将文本检索与基于结构和基于拓扑的查询集成。它们使用基于文档网络的新颖“虚拟图”模型来提供使用微积分的WebSQL的正式语义。它们提出了一种基于“查询局部性”的想法的查询成本的新理论,即必须访问多少网络以应答特定查询。最后,他们描述了在Java中编写的WebSQL的原型实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号