【24h】

On Extracting a Database Schema from Semistructured Documents

机译:从半结构化文档中提取数据库模式

获取原文
获取原文并翻译 | 示例

摘要

In semistructured data, the data structure is irregular and no explicit database schema is given, which causes several problems such as inefficient data retrieval and wasteful data storage. To cope with such problems, some algorithms extracting database schema from semistructured data have been proposed, in which data is modeled as an unordered tree. However, the order of elements is indispensable for document data, therefore we model data as an ordered tree and consider a problem of extracting an optimum database schema from semistructured data. We first show that the corresponding decision problem is strongly NP-complete. We next propose a polynomial-time algorithm for extracting a database schema.
机译:在半结构化数据中,数据结构是不规则的,没有给出明确的数据库架构,这会导致一些问题,例如数据检索效率低下和数据存储浪费。为了解决这些问题,已经提出了一些从半结构化数据中提取数据库模式的算法,其中将数据建模为无序树。但是,元素的顺序对于文档数据是必不可少的,因此我们将数据建模为有序树,并考虑了从半结构化数据中提取最佳数据库模式的问题。我们首先表明,相应的决策问题是强烈的NP完全问题。接下来,我们提出用于提取数据库模式的多项式时间算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号