...
首页> 外文期刊>Knowledge and information systems >Extend tree edit distance for effective object identification
【24h】

Extend tree edit distance for effective object identification

机译:扩展树编辑距离以有效地识别对象

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Similarity join on XML documents which are usually modeled as rooted ordered labeled trees is widely applied, due to the ambiguity of references to the real-world objects. The conventional method dealing with this issue is based on tree edit distance, which is shortage of flexibility and efficiency. In this paper, we propose two novel edit operations together with extended tree edit distance, which can achieve good performance in similarity matching with hierarchical data structures [the run-time is in the worst case]. And then, we propose -generation set distance as a good approximation of the tree edit distance to further improve the join efficiency with quadric time complexity. Experiments on real and synthetic databases demonstrate the benefit of our method in efficiency and scalability.
机译:由于对现实世界对象的引用含混不清,因此通常将XML文档上的相似性联接(通常以根有序标记的树作为模型)进行广泛应用。解决该问题的常规方法是基于树的编辑距离,其缺乏灵活性和效率。在本文中,我们提出了两种新颖的编辑操作以及扩展的树形编辑距离,它们可以在与分层数据结构进行相似性匹配时获得良好的性能[运行时是最坏的情况]。然后,我们提出将生成集距离作为树编辑距离的一个很好的近似值,以进一步提高连接效率和二次时间复杂度。在真实和综合数据库上进行的实验证明了我们的方法在效率和可伸缩性方面的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号