首页> 外文会议>International conference on web information systems engineering >Discovering Functional Dependencies in Vertically Distributed Big Data
【24h】

Discovering Functional Dependencies in Vertically Distributed Big Data

机译:在垂直分布的大数据中发现功能依赖性

获取原文

摘要

The issue of discovering FDs has received a great deal of attention in the database research community. However, as the problem is exponential in the number of attributes, existing approaches can only be applied on small centralized datasets. It is challenging to discover FDs from big data, especially if data is distributed. We present a new algorithm DFDD for discovering all functional dependencies in parallel in vertically distributed big data following a breadth-first traversal strategy of the attribute lattice that combines efficient pruning. We verify experimentally that our approach can process distributed big datasets and it is scalable with the number of cluster nodes and the size of datasets.
机译:发现FD的问题在数据库研究界引起了广泛的关注。但是,由于问题是属性数量呈指数级增长,因此现有方法只能应用于小型集中式数据集。从大数据中发现FD非常具有挑战性,尤其是在数据分散的情况下。我们提出了一种新算法DFDD,它遵循结合了有效修剪的属性格的广度优先遍历策略,在垂直分布的大数据中并行发现所有功能依赖项。我们通过实验验证了我们的方法可以处理分布式大数据集,并且可以随着集群节点的数量和数据集的大小而扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号