首页> 外文会议>International Conference on Software Engineering and Data Engineering >METRIZED SMALL WORLD PROPERTIES BASED DATA STRUCTURE
【24h】

METRIZED SMALL WORLD PROPERTIES BASED DATA STRUCTURE

机译:基于Radized小世界属性的数据结构

获取原文

摘要

We introduce the information retrieval oriented data structure to build very large, scalable, loosely structured and unstructured distributed data storage. The main idea is to represent data as a set of structured storage units on which a semi-metric can be defined which characterizes the relative relevance of each unit. Then a complex graph can be constructed whose vertices are the storage units and the edges are selected in such a way that the graph has the small world properties and is in accordance with the introduced metric (Metrized Small World Feature). Addition and removal of the data items causes the graph to evolve, while the retrieval of information is based on generating a new vertex, connecting it to the graph and setting up a search process of the data vertices metrically close to the request vertex. Due to the special properties of the constructed graph, the search is accomplished on average in the number of steps logarithmic of the storage size. We built a prototype of such a storage where the data items are represented by XML documents and the graph is expressed by means of XLink. The analysis of the graph properties we performed confirmed the possibility of building efficient XML data storages which contain hundreds of petabytes of data.
机译:我们介绍了信息检索导向数据结构,以构建非常大,可扩展,松散的结构和非结构化分布式数据存储。主要思想是将数据表示为一组结构化存储单元,在其上可以定义半标题,其表征每个单元的相对相关性。然后可以构造复杂的图形,其顶点是存储单元,并且以这种方式选择边缘,使得图表具有小的世界性质并且符合引入的度量(称为Metric Small World特征)。添加和删​​除数据项导致图表进化,而信息的检索是基于生成新顶点,将其连接到图表并将数据顶点的搜索过程设置为靠近请求顶点的数据顶点。由于构造图的特殊属性,在存储大小的步骤数的步骤数的数量中平均完成搜索。我们构建了这样一个存储的原型,其中数据项由XML文档表示,并且图表通过XLink表示。我们执行的图形属性的分析确认了构建有数百个PETABTES数据的高效XML数据存储的可能性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号