...
首页> 外文期刊>IEICE transactions on information and systems >The BINDS-Tree: A Space-Partitioning Based Indexing Scheme for Box Queries in Non-Ordered Discrete Data Spaces
【24h】

The BINDS-Tree: A Space-Partitioning Based Indexing Scheme for Box Queries in Non-Ordered Discrete Data Spaces

机译:BINDS树:一种基于空间分区的索引结构,用于非有序离散数据空间中的盒查询

获取原文
           

摘要

In recent years we have witnessed an increasing demand to process queries on large datasets in Non-ordered Discrete Data Spaces (NDDS). In particular, one type of query in an NDDS, called box queries, is used in many emerging applications including error corrections in bioinformatics and network intrusion detection in cybersecurity. Effective indexing methods are necessary for efficiently processing queries on large datasets in disk. However, most existing NDDS indexing methods were not designed for box queries. Several recent indexing methods developed for box queries on a large NDDS dataset in disk are based on the popular data-partitioning approach. Unfortunately, a space-partitioning based indexing scheme, which is more effective for box queries in an NDDS, has not been studied before. In this paper, we propose a novel indexing method based on space-partitioning, called the BINDS-tree, for supporting efficient box queries on a large NDDS dataset in disk. A number of effective strategies such as node split based on minimum span and cross optimal balance, redundancy reduction utilizing a singleton dimension inheritance property, and a space-efficient structure for the split history are incorporated in the constructing algorithm for the BINDS-tree. Experimental results demonstrate that the proposed BINDS-tree significantly improves the box query I/O performance, comparing to that of the state-of-the-artdata-partitioning based NDDS indexing method.
机译:近年来,我们见证了对在无序离散数据空间(NDDS)中对大型数据集进行查询的需求不断增长。特别是,NDDS中的一种查询(称为盒查询)被用于许多新兴应用中,包括生物信息学中的错误校正和网络安全中的网络入侵检测。有效的索引方法对于有效处理磁盘上大型数据集的查询是必需的。但是,大多数现有的NDDS索引方法都不是为盒查询设计的。针对流行的磁盘大NDDS数据集的盒式查询开发的几种最新索引方法是基于流行的数据分区方法。不幸的是,以前从未研究过基于空间分区的索引方案,该方案对于NDDS中的框查询更有效。在本文中,我们提出了一种基于空间划分的新索引方法,称为BINDS树,用于支持对磁盘上的大型NDDS数据集进行有效的盒查询。在BINDS树的构造算法中,结合了许多有效的策略,例如基于最小跨度和交叉最优平衡的节点拆分,利用单例维继承属性的冗余减少以及拆分历史的空间高效结构。实验结果表明,与基于最新数据分区的NDDS索引方法相比,所提出的BINDS树显着提高了盒查询I / O性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号