首页> 外文学位 >Database design for spatial network management systems: Clustering and declustering techniques.
【24h】

Database design for spatial network management systems: Clustering and declustering techniques.

机译:空间网络管理系统的数据库设计:聚类和聚类技术。

获取原文
获取原文并翻译 | 示例

摘要

Spatial network databases are the kernel of many important applications, including transportation planning; water, electric and gas utilities; telephone networks; urban management; sewer maintenance, etc. The research question in this thesis concerns how to store and organize data in order to support efficient data access for spatial network databases. To address this question, first, clustering can be used to organize and store query-related objects together (e.g. into disk blocks) to reduce the access cost. Second, parallelism can be exploited to reduce the response time for querying over large sets of objects. This is achieved by declustering objects (e.g. disk-blocks) across multiple-disks, which can be accessed in parallel to reduce the response time for large queries.; This thesis shows that the effectiveness (i.e. I/O overhead) of clustering techniques for networks is directly dependent on the Weighted Connectivity Residue Ratio (WCRR), i.e., the chance that a pair of connected nodes that are more likely to be accessed together are allocated to a common page of the file. Based on this insight, this thesis proposes a connectivity-clustered access method (CCAM). The nodes of the network are clustered into disk pages, via a graph-partitioning, approach to maximize the WCRR. CCAM includes methods for static clustering, as well as for dynamic incremental reclustering, to maintain a high WCRR in the face of updates, without incurring high overheads. This thesis also describes possible modifications to improve the WCRR that can be achieved by existing spatial access methods. This thesis uses the spatial network data and network computation queries from the domain of Intelligent Vehicle Highway Systems (IVHS) to evaluate the proposed ideas.; In addressing declustering, this thesis proposes a new similarity-based declustering technique which is based on the max-cut partitioning of similarity graphs that model the data-sets and query-sets. This technique can adapt to available information about query distribution and can work with alternative data-types, data distributions, data sizes and partition-size constraints. Experiments in parallelizing Grid Files for spatial range queries show that this method outperforms traditional methods for several interesting query distributions, as well for several non-uniform data distributions.
机译:空间网络数据库是许多重要应用程序的核心,包括运输计划;水,电和煤气公用事业;电话网络;城市管理;本文的研究问题涉及如何存储和组织数据以支持对空间网络数据库的有效数据访问。为了解决这个问题,首先,可以使用群集将与查询相关的对象组织在一起并存储在一起(例如,放入磁盘块中),以降低访问成本。其次,可以利用并行性来减少用于查询大型对象集的响应时间。这是通过在多个磁盘上对对象(例如磁盘块)进行分簇来实现的,可以并行访问这些对象以减少大型查询的响应时间。该论文表明,网络集群技术的有效性(即I / O开销)直接取决于加权连接残差比(WCRR),即,一对更可能一起访问的连接节点的机会分配给文件的公共页面。基于这种见解,本文提出了一种连接集群访问方法(CCAM)。网络的节点通过图分区方法被群集到磁盘页面中,以最大化WCRR。 CCAM包括用于静态集群以及用于动态增量重新聚类的方法,以在更新时保持较高的WCRR,而不会产生高开销。本文还描述了可能通过现有空间访问方法实现的改进WCRR的修改。本文使用来自智能汽车公路系统(IVHS)领域的空间网络数据和网络计算查询来评估所提出的想法。为了解决聚类问题,本文提出了一种基于相似度的聚类新技术,该技术基于对数据集和查询集建模的相似度图的最大割分。此技术可以适应有关查询分布的可用信息,并且可以与其他数据类型,数据分布,数据大小和分区大小约束一起使用。针对空间范围查询并行化Grid File的实验表明,对于几种有趣的查询分布以及几种非均匀的数据分布,此方法优于传统方法。

著录项

  • 作者

    Liu, Duen-Ren.;

  • 作者单位

    University of Minnesota.;

  • 授予单位 University of Minnesota.;
  • 学科 Engineering Electronics and Electrical.; Computer Science.
  • 学位 Ph.D.
  • 年度 1995
  • 页码 132 p.
  • 总页数 132
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号