首页> 外文会议>International conference on database and expert systems applications >Data Value Storage for Compressed Semi-structured Data
【24h】

Data Value Storage for Compressed Semi-structured Data

机译:压缩半结构化数据的数据值存储

获取原文

摘要

Growing user expectations of anywhere, anytime access to information require new types of data representations to be considered. While semi-structured data is a common exchange format, its verbose nature makes files of this type too large to be transferred quickly, especially where only a small part of that data is required by the user. There is consequently a need to develop new models of data storage to support the sharing of small segments of semi-structured data since existing XML compressors require the transfer of the entire compressed structure as a single unit. This paper examines the potential for bisimilarity-based partitioning (i.e. the grouping of items with similar structural patterns) to be combined with dictionary compression methods to produce a data storage model that remains directly accessible for query processing whilst facilitating the sharing of individual data segments. Study of the effects of differing types of bisimilarity upon the storage of data values identified the use of both forwards and backwards bisimilarity as the most promising basis for a dictionary-compressed structure. A query strategy is detailed that takes advantage of the compressed structure to reduce the number of data segments that must be accessed (and therefore transferred) to answer a query. A method to remove redundancy within the data dictionaries is also described and shown to have a positive effect on memory usage.
机译:用户对随时随地访问信息的期望越来越高,这需要考虑新型的数据表示形式。尽管半结构化数据是一种常见的交换格式,但其冗长的性质使该类型的文件太大而无法快速传输,尤其是在用户只需要该数据的一小部分的情况下。因此,由于现有的XML压缩程序需要将整个压缩结构作为单个单元进行传输,因此需要开发新的数据存储模型以支持小部分半结构化数据的共享。本文研究了基于双相似度的分区(即具有相似结构模式的项目的分组)与字典压缩方法相结合以产生可直接访问以进行查询处理,同时促进各个数据段共享的数据存储模型的潜力。对不同类型的双相似性对数据值存储的影响的研究表明,正向和反向双相似性都是字典压缩结构最有希望的基础。详细介绍了一种查询策略,该策略利用压缩的结构来减少必须访问(并因此传输)的数据段以回答查询。还描述了一种删除数据字典内冗余的方法,该方法显示出对内存使用有积极影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号