首页> 外文会议>International Workshop on Knowledge Discovery from XML Documents >Information Retrieval from Distributed Semistructured Documents Using Metadata Interface
【24h】

Information Retrieval from Distributed Semistructured Documents Using Metadata Interface

机译:使用元数据接口从分布式半系统的信息检索

获取原文

摘要

We describe a method for retrieving information from distributed heterogeneous semistructured documents, and its implementation in the metadata interface DDXMI (Distributed Document XML Metadata Interface). The system generates local queries appropriate for local schemas from a user query over the global schema and shows the result of the generated queries. The three components are designed to generate the local queries: mappings between global schema and local schemas (extracted from local documents if not given), path substitution, and node identification for resolving the heterogeneity among nodes with the same label that often exist in semistructured data. The system uses Quilt as its XML query language. An experiment is reported over three local semistructured documents: ‘thesis’, ‘reports’, and ‘journal’ documents with ‘article’ global schema. The prototype was developed under Windows system with Java and JavaCC.
机译:我们介绍了一种从分布式异构半系统文档中检索信息的方法,以及它在元数据接口DDXMI(分布式文档XML元数据接口)中的实现。系统生成适合于全局架构的用户查询的本地查询,并显示生成查询的结果。这三个组件旨在生成本地查询:全局架构和本地模式之间的映射(如果未给出的本地文档提取),路径替换和节点标识,用于解决具有在半系统中通常存在的相同标签的节点之间的异构性。 。该系统使用被子作为其XML查询语言。在三种局部半系统的文件中报告了一个实验:“论文”,“报告”和“日记”文件,“文章”全球架构。使用Java和Javacc在Windows系统下开发了原型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号