Complex Queries over Web Repositories

机译：Web存储库上的复杂查询

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Web repositories, such as the Stanford WebBase repository, manage large heterogeneous collections of Web pages and associated indexes. For effective analysis and mining, these repositories must provide a declarative query interface that supports complex expressive Web queries. Such queries have two key characteristics: (ⅰ) They view a Web repository simultaneously as a collection of text documents, as a navigable directed graph, and as a set of relational tables storing properties of Web pages (length, URL, title, etc.). (ⅱ) The queries employ application-specific ranking and ordering relationships over pages and links to filter out and retrieve only the "best" query results. In this paper, we model a Web repository in terms of "Web relations" and describe an algebra for expressing complex Web queries. Our algebra extends traditional relational operators as well as graph navigation operators to uniformly handle plain, ranked, and ordered Web relations. In addition, we present an overview of the cost-based optimizer and execution engine that we have developed, to efficiently execute Web queries over large repositories.

机译：Web资料库，例如Stanford WebBase资料库，管理着大量异构的Web页面集和相关索引。为了进行有效的分析和挖掘，这些存储库必须提供一个声明性查询接口，以支持复杂的表达性Web查询。这样的查询具有两个关键特征：（ⅰ）它们同时将Web存储库视为文本文档的集合，可导航的有向图以及一组存储Web页属性（长度，URL，标题等）的关系表。）。（ⅱ）查询在页面和链接上使用特定于应用程序的排名和排序关系，以仅过滤和检索“最佳”查询结果。在本文中，我们根据“ Web关系”对Web存储库进行建模，并描述了表示复杂Web查询的代数。我们的代数扩展了传统的关系运算符以及图形导航运算符，以统一处理普通，排名和有序的Web关系。此外，我们还概述了我们开发的基于成本的优化器和执行引擎，以在大型存储库上有效执行Web查询。

著录项

来源
《Twenty-ninth International Conference on Very Large Databases; Sep 9-12, 2003; Berlin, Germany》|2003年|p.33-44|共12页
会议地点 Berlin(DE);Berlin(DE)
作者
Sriram Raghavan; Hector Garcia-Molina;
展开▼
作者单位

Computer Science Department Stanford University Stanford, CA 94305, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
入库时间 2022-08-26 14:15:30

相似文献

外文文献
中文文献
专利

1. Query-Performance Prediction for Effective Query Routing in Domain-Specific Repositories [J] . Surendra Sarnikar, Zhu Zhang, J. Leon Zhao Journal of the American Society for Information Science . 2014,第8期

机译：特定域存储库中有效查询路由的查询性能预测
2. Special issue on querying the data web Novel techniques for querying structured data on the web [J] . Paolo Ceravolo, Chengfei Liu, Mustafa Jarrar, World Wide Web . 2011,第5a6期

机译：有关查询数据Web的特殊问题用于查询Web结构化数据的新颖技术
3. PRISM: a web server and repository for prediction of protein–protein interactions and modeling their 3D complexes [J] . Alper Baspinar, Attila Gursoy, Engin Cukuroglu, Nucleic acids research . 2014,第W1期

机译：PRISM：用于预测蛋白质相互作用的3D复合体的网络服务器和存储库
4. Complex Queries over Web Repositories [C] . Sriram Raghavan, Hector Garcia-Molina International conference on very large databases . 2003

机译：Web存储库中的复杂查询
5. The Web interfacing repository manager: A framework for developing Web-based experiment management systems. [D] . Jakobovits, Rex Matthew. 1999

机译：Web接口存储库管理器：用于开发基于Web的实验管理系统的框架。
6. Facilitating Cohort Discovery by Enhancing Ontology Exploration Query Management and Query Sharing for Large Clinical Data Repositories [O] . Shiqiang Tao, Licong Cui, Xi Wu, 2017

机译：通过增强大型临床数据存储库的本体探索查询管理和查询共享来促进队列发现
7. Complex Queries over Web Repositories [O] . Sriram Raghavan Hector, Web Repositories 2003

机译：Web存储库的复杂查询

Complex Queries over Web Repositories

摘要

著录项

相似文献

相关主题

期刊订阅