首页> 外文期刊>Journal of Intelligent Information Systems >Unary And N-ary Inclusion Dependency Discovery In Relational Databases
【24h】

Unary And N-ary Inclusion Dependency Discovery In Relational Databases

机译:关系数据库中的一元和N元包含依赖项发现

获取原文
获取原文并翻译 | 示例
       

摘要

Foreign keys form one of the most fundamental constraints for relational databases. Since they are not always defined in existing databases, the discovery of foreign keys turns out to be an important and challenging task. The underlying problem is known to be the inclusion dependency (IND) inference problem. In this paper, data-mining algorithms are devised for IND inference in a given database. We propose a two-step approach. In the first step, unary INDs are discovered thanks to a new preprocessing stage which leads to a new algorithm and to an efficient implementation. In the second step, n-ary IND inference is achieved. This step fits in the framework of levelwise algorithms used in many data-mining algorithms. Since real-world databases can suffer from some data inconsistencies, approximate INDs, i.e. INDs which almost hold, are considered. We show how they can be safely integrated into our unary and n-ary discovery algorithms. An implementation of these algorithms has been achieved and tested against both synthetic and real-life databases. Up to our knowledge, no other algorithm does exist to solve this data-mining problem.
机译:外键构成关系数据库的最基本约束之一。由于并非总是在现有数据库中定义它们,因此外键的发现是一项重要且具有挑战性的任务。已知的基本问题是包含依赖性(IND)推断问题。在本文中,为给定数据库中的IND推理设计了数据挖掘算法。我们提出了两步法。第一步,由于新的预处理阶段而发现一元IND,这会导致新的算法和有效的实现。第二步,实现n元IND推断。此步骤适合于许多数据挖掘算法中使用的逐级算法框架。由于现实世界中的数据库可能会遇到一些数据不一致的问题,因此需要考虑近似的IND,即几乎保持的IND。我们展示了如何将它们安全地集成到我们的一元和n元发现算法中。这些算法的实现已实现,并已针对合成数据库和现实数据库进行了测试。据我们所知,没有其他算法可以解决此数据挖掘问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号