首页> 外文会议>IEEE/ACIS International Conference on Computer and Information Science >Web and Document Databases: An Effective Way to Explore the Internet
【24h】

Web and Document Databases: An Effective Way to Explore the Internet

机译:Web和文档数据库:探索互联网的有效方式

获取原文

摘要

In this paper, we discuss the architecture of a system, the so-called Web and Document Databases (WDDBS for short), designed to explore the Internet effectively and efficiently. Abstractly, a WDDBS can be defined as a triple, where (1) D stands for a local docu¬ment database to store XML documents, (2) P for a subsystem responsible for remote query evaluation, including resolution of semantic conflicts among heterogeneous databases, and (3) W for a Web crawler which should be able to find information sources related to the local database in some way. Then, each information source can be organized into a WDDB distributed over the Internet, which may be con¬nected to others through URLs. A query submitted to a WDDBS will first be evaluated against the local document database, and then possibly switched over to some remote document databases if necessary, which is controlled by the ‘knowledge’ on how local WDDBSs are connected. In this way, the load of traffic over the Internet can effectively be decreased, but the information explored is more relevant.
机译:在本文中,我们讨论了系统的架构,所谓的Web和文档数据库(简称WDDB),旨在有效探索互联网。禁止,WDDB可以定义为三倍,其中(1)d代表本地文档数据库,用于存储XML文档,(2)对于负责远程查询评估的子系统,包括异构数据库中的语义冲突的分辨率(3)对于Web爬网程序的W,这应该能够以某种方式找到与本地数据库相关的信息源。然后,可以将每个信息源组织到分布在因特网上的WDDB中,这可以通过URL被连接到其他人。首先将根据本地文档数据库评估提交给WDDB的查询,然后如果需要,可能会转换为某些远程文档数据库,该数据库由“知识”控制本地WDDBS的连接。通过这种方式,可以有效地减少互联网上的流量负荷,但探索的信息更为相关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号