首页> 外文学位 >A performance study of XML query optimization techniques.
【24h】

A performance study of XML query optimization techniques.

机译:XML查询优化技术的性能研究。

获取原文
获取原文并翻译 | 示例

摘要

As computers and technology continue to become more commonplace and essential to everyday life, more data is captured, stored, and analyzed by a variety of institutions in government, education, and the private sector. As this amount of data grows, so does the need for efficient methodologies and tools used to store, retrieve, and transform the data. A common method used to store this schemaless, semi-structured data is through the Extensible Markup Language, XML. In this way, an XML document is viewed as a database. With this sizable amount of data stored in a common format, one problem is how to efficiently query XML documents. While relational database management systems contain built-in query optimizers, no such framework exists for XML databases. A multitude of document shapes, query shapes, index structures, and query techniques exist for XML databases, but the implications of these choices and their effects on query processing have not been investigated in a common framework. This dissertation identifies a set of representative query techniques, document structures, and query styles for XML databases and provides a common framework for classifying the various query techniques, structures, and styles. We identify two broad classifications of query techniques, native XML and non-native XML, and develop a cost-based model for each technique that models query performance from an execution standpoint. We also develop our own query technique, RDBQuery, as an extension and major enhancement to a previously existing non-native XML query technique that leverages a relational database management system to efficiently process XML queries. To evaluate relative query performance, we compare the techniques for various parameters that impact their performance, including query shape and document shape/size, and the results are presented through a series of graphs. These graphs and their underlying cost models are used to present an optimization framework for XML queries, and this provides the essential foundation in development of an integrated cost-based XML query optimizer.
机译:随着计算机和技术继续变得越来越普遍和对日常生活至关重要,越来越多的数据被政府,教育和私营部门的各种机构捕获,存储和分析。随着数据量的增长,对用于存储,检索和转换数据的有效方法和工具的需求也在增加。存储这种无模式,半结构化数据的常用方法是通过可扩展标记语言XML。这样,可以将XML文档视为数据库。使用通用格式存储的大量数据中,一个问题是如何有效地查询XML文档。虽然关系数据库管理系统包含内置的查询优化器,但XML数据库不存在这样的框架。 XML数据库存在大量文档形状,查询形状,索引结构和查询技术,但是尚未在一个通用框架中研究这些选择的含义及其对查询处理的影响。本文为XML数据库确定了一组代表性的查询技术,文档结构和查询样式,并提供了用于分类各种查询技术,结构和样式的通用框架。我们确定了查询技术的两种大致分类,即本地XML和非本地XML,并针对从执行角度对查询性能建模的每种技术,开发了一种基于成本的模型。我们还开发了自己的查询技术RDBQuery,作为对以前存在的非本地XML查询技术的扩展和主要增强,该技术利用关系数据库管理系统来有效处理XML查询。为了评估相对查询性能,我们比较了影响其性能的各种参数的技术,包括查询形状和文档形状/大小,并通过一系列图形显示了结果。这些图及其底层成本模型用于提供XML查询的优化框架,这为开发基于成本的集成XML查询优化器提供了必要的基础。

著录项

  • 作者单位

    University of Cincinnati.;

  • 授予单位 University of Cincinnati.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 279 p.
  • 总页数 279
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号