...
首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >Indexing through separable partitioning for complex data sharing in P2P systems
【24h】

Indexing through separable partitioning for complex data sharing in P2P systems

机译:通过可分离分区索引P2P系统中的复杂数据共享

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Similarity search for content-based retrieval - a sustained problem; many applications endures. Most of the similarity measures intend focusing the least possible set of elements to find an answer. In the literature, most work is based on splitting the target data set into subsets using balls. However, in the era of big data, where efficient indexing is of vital importance, the subspace volumes grow exponentially, which could degenerate the index. This problem arises due to inherent insufficiency of space partitioning interlaced with the overlap factor among the regions. This affects the search algorithms thereby rendering these methods ineffective as it gets hard to store, manage and analyze the aforementioned quantities. A good topology should avoid biased allocation of objects for separable sets and should not influence the structure of the index. We put-forward a novel technique for indexing; IMB-tree, which limits the volume space, excludes the empty sets; the separable partitions, does not contain objects and creates eXtended regions that will be inserted into a new index named eXtended index, implemented in a P2P environment. These can reunite all objects in one of the subsets-partitions; either in a separable set or in the exclusion set, keeping the others empty. We also discussed the efficiency of construction and search algorithms, as well as the quality of the index. The experimental results show interesting performances.
机译:相似性搜索基于内容的检索 - 持续的问题;许多应用程序都持久。大多数相似性措施都设计专注于找到答案的最少可能的元素集。在文献中,大多数作品都基于使用球将目标数据分成亚包。然而,在大数据的时代,有效索引至关重要的情况下,子空间量呈指数级增长,这可能脱索。由于空间分区的固有不足因区域之间的重叠因子而存在,因此出现了这个问题。这会影响搜索算法,从而使这些方法无效,因为它难以存储,管理和分析上述量。良好的拓扑应避免偏见分配可分离集的对象,不应影响指数的结构。我们提出了一种用于索引的新技术; IMB-Tree限制卷空间,排除了空集;可分离分区不包含对象,并创建将插入名为Extended索引的新索引的扩展区域,该索引在P2P环境中实现。这些可以在其中一个子集分区中重聚所有对象;无论是可分离的集合还是排除集合,都将其他物体保持为空。我们还讨论了施工和搜索算法的效率,以及索引的质量。实验结果表明有趣的表现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号