首页> 外文会议>Proceedings of the Tenth international conference on Information and knowledge management >Induction of integrated view for XML data with heterogeneous DTDs
【24h】

Induction of integrated view for XML data with heterogeneous DTDs

机译:利用异构DTD引入XML数据的集成视图

获取原文

摘要

This paper proposes a novel approach to integrating heterogeneous XML DTDs. With this approach, an information agent can be easily extended to integrate heterogeneous XML-based contents and perform federated search. Based on a tree grammar inference technique, this approach derives an integrated view of XML DTDs in an information integration framework. The derivation takes advantages of naming and structural similarities among DTDs in similar domains. The complete approach consists of three main steps. (1) DTD clustering clusters DTDs in similar domains into classes. (2) Schema learning applies a tree grammar inference technique to generate a set of tree grammar rules from the DTDs in a class from the previous step. (3) Minimization optimizes the rules generated in the previous step and transforms them into an integrated view. We have implemented the proposed approach into a system called DEEP and tested the system on artificial and real domains. The experimental results reveal that this system can effectively and efficiently integrate radically different DTDs.
机译:本文提出了一种集成异构XML DTD的新颖方法。通过这种方法,可以轻松地扩展信息代理以集成基于XML的异构内容并执行联合搜索。基于树语法推断技术,此方法在信息集成框架中派生XML DTD的集成视图。该推导利用了相似域中DTD之间的命名和结构相似性。完整的方法包括三个主要步骤。 (1) DTD聚类将相似域中的DTD聚类为类。 (2)模式学习应用树语法推断技术从上一步中的类中的DTD生成树语法规则集。 (3) Minimization 优化上一步中生成的规则,并将其转换为集成视图。我们已将提出的方法实施到名为 DEEP 的系统中,并在人工和真实域上对该系统进行了测试。实验结果表明,该系统可以有效,高效地集成根本不同的DTD。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号