首页> 外文会议>International Conference on Data Science >Functional Dependency Discovery on Distributed Database: Sampling Verification Framework
【24h】

Functional Dependency Discovery on Distributed Database: Sampling Verification Framework

机译:分布式数据库上的功能依赖性发现:采样验证框架

获取原文

摘要

In relational databases, functional dependencies discovery is a very important database analysis technology, which has a wide range of applications in knowledge discovery, database semantic analysis, data quality assessment and database design. The existing functional dependencies discovery algorithms are mainly designed for centralized data, which are usually only applicable when the data size is small. With the rapid development of the database scale of the times, the distributed environment function dependence discovery has more and more important practical significance. A functional dependencies discovery algorithm for big data in distributed environment is proposed. The basic idea is to first perform functional dependencies discovery on the sampled data set, and then globally verify the functional dependencies that may be globally established, so that all functional dependencies can be discovered. Parallel computing can be used to improve discovery efficiency while ensuring correctness.
机译:在关系数据库中,功能依赖关系发现是一个非常重要的数据库分析技术,它在知识发现,数据库语义分析,数据质量评估和数据库设计中具有广泛的应用。现有的功能依赖关系发现算法主要设计用于集中数据,这通常仅适用于数据大小小时。随着数据库规模的快速发展时代,分布式环境函数依赖发现具有越来越重要的实际意义。提出了一种分布式环境中大数据的功能依赖性发现算法。基本思想是首先在采样的数据集上执行功能依赖性发现,然后全局验证可以在全局建立的功能依赖关系,从而可以发现所有功能依赖性。并行计算可用于提高发现效率,同时确保正确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号