【24h】

Collaborative Mining and Transfer Learning for Relational Data

机译:关系数据的协作挖掘和迁移学习

获取原文
获取原文并翻译 | 示例

摘要

Many of the real-world problems, - including human knowledge, communication, biological, and cyber network analysis, - deal with data entities for which the essential information is contained in the relations among those entities. Such data must be modeled and analyzed as graphs, with attributes on both objects and relations encode and differentiate their semantics. Traditional data mining algorithms were originally designed for analyzing discrete objects for which a set of features can be defined, and thus cannot be easily adapted to deal with graph data. This gave rise to the relational data mining field of research, of which graph pattern learning is a key sub-domain. In this paper, we describe a model for learning graph patterns in collaborative distributed manner. Distributed pattern learning is challenging due to dependencies between the nodes and relations in the graph, and variability across graph instances. We present three algorithms that trade-off benefits of parallelization and data aggregation, compare their performance to centralized graph learning, and discuss individual benefits and weaknesses of each model. Presented algorithms are designed for linear speedup in distributed computing environments, and learn graph patterns that are both closer to ground truth and provide higher detection rates than centralized mining algorithm.
机译:现实世界中的许多问题,包括人类知识,通讯,生物和网络分析,都涉及数据实体,这些实体之间的关系中包含了必不可少的信息。这样的数据必须以图表的形式进行建模和分析,对象和关系上的属性都对它们的语义进行编码和区分。传统的数据挖掘算法最初是为分析离散对象而设计的,可以为这些对象定义一组功能,因此不容易适应于处理图形数据。这引起了关系数据挖掘的研究领域,其中图模式学习是关键的子领域。在本文中,我们描述了一种以协作分布式方式学习图模式的模型。由于图中节点和关系之间的依赖性以及图实例之间的可变性,因此分布式模式学习具有挑战性。我们提出了三种算法,它们权衡了并行化和数据聚合的好处,将它们的性能与集中式图学习进行了比较,并讨论了每种模型的个别优点和缺点。提出的算法设计用于分布式计算环境中的线性加速,并且学习比集中挖掘算法更接近地面实况并提供更高检测率的图形模式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号