首页> 外文会议>International conference on information and knowledge management >Induction of Integrated View for XML Data with Heterogeneous DTDs
【24h】

Induction of Integrated View for XML Data with Heterogeneous DTDs

机译:异构DTD的XML数据诱导综合视图

获取原文

摘要

This paper proposes a novel approach to integrating heterogeneous XML DTDs. With this approach, an information agent can be easily extended to integrate heterogeneous XML-based contents and perform federated search. Based on a tree grammar inference technique, this approach derives an integrated view of XML DTDs in an information integration framework. The derivation takes advantages of naming and structural similarities among DTDs in similar domains. The complete approach consists of three main steps. (1) DTD clustering clusters DTDs in similar domains into classes. (2) Schema learning applies a tree grammar inference technique to generate a set of tree grammar rules from the DTDs in a class from the previous step. (3) Minimization optimizes the rules generated in the previous step and transforms them into an integrated view. We have implemented the proposed approach into a system called DEEP and tested the system on artificial and real domains. The experimental results reveal that this system can effectively and efficiently integrate radically different DTDs.
机译:本文提出了一种集成异质XML DTD的新方法。通过这种方法,可以轻松扩展信息代理以集成基于异构的XML的内容并执行联合搜索。基于树语语法推理技术,该方法在信息集成框架中源于XML DTD的综合视图。衍生在类似域中的DTD之间的命名和结构相似性的优点。完整的方法包括三个主要步骤。 (1)DTD群集群集DTD在类似域中的类别。 (2)架构学习应用树语语法推理技术从上一步中从DTD生成一组树语语法规则。 (3)最小化优化上一步中生成的规则,将它们转换为集成视图。我们已经实施了提出的方法进入一个被称为深度并测试人工和真实域的系统的方法。实验结果表明,该系统能够有效地有效地整合到完全不同的DTD。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号