Probabilistic and exact frequent subtree mining in graphs beyond forests

Welke Pascal; Horvath Tamas; Wrobel Stefan

首页> 外文期刊>Machine Learning >Probabilistic and exact frequent subtree mining in graphs beyond forests

【24h】

Probabilistic and exact frequent subtree mining in graphs beyond forests

机译：在森林之外的图表中的概率和精确频繁的子树挖掘

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Motivated by the impressive predictive power of simple patterns, we consider the problem of mining frequent subtrees in arbitrary graphs. Although the restriction of the pattern language to trees does not resolve the computational complexity of frequent subgraph mining, in a recent work we have shown that it gives rise to an algorithm generating probabilistic frequent subtrees, a random subset of all frequent subtrees, from arbitrary graphs with polynomial delay. It is based on replacing each transaction graph in the input database with a forest formed by a random subset of its spanning trees. This simple technique turned out to be quite powerful on molecule classification tasks. It has, however, the drawback that the number of sampled spanning trees must be bounded by a polynomial of the size of the transaction graphs, resulting in less impressive recall even for slightly more complex structures beyond molecular graphs. To overcome this limitation, in this work we propose an algorithm mining probabilistic frequent subtrees also with polynomial delay, but by replacing each graph with a forest formed by an exponentially large implicit subset of its spanning trees. We demonstrate the superiority of our algorithm over the simple one on threshold graphs used e.g. in spectral clustering. In addition, providing sufficient conditions for the completeness and efficiency of our algorithm, we obtain a positive complexity result on exact frequent subtree mining for a novel, practically and theoretically relevant graph class that is orthogonal to all graph classes defined by some constant bound on monotone graph properties.

机译：通过简单模式的令人印象深刻的预测力，我们认为在任意图中挖出频繁的子树的问题。虽然图案语言对树木的限制并不能解决频繁的子图挖掘的计算复杂性，但在最近的工作中，我们已经表明它产生了一种从任意图形产生概率频繁子树的算法，从任意图中占据所有频繁子树的随机子集具有多项式延迟。它基于将输入数据库中的每个交易图替换为具有由其生成树的随机子集形成的森林。这种简单的技术在分子分类任务上是非常强大的。然而，它具有所采样的跨越树的数量必须由交易图的大小的多项式限制的缺点，从而令人印象深刻的召回，即使对于超出分子图的稍微复杂的结构也甚至令人印象深刻。为了克服这一限制，在这项工作中，我们提出了一种算法挖掘概率常见的子树，也具有多项式延迟，而是通过用跨越树的指数大隐式子集形成的森林来替换每个图。我们通过例如在使用的阈值图上展示了我们算法的优越性。在光谱聚类中。此外，为我们的算法的完整性和效率提供足够的条件，我们获得了正常的复杂性导致精确的频繁的子树挖掘，几乎和理论上和理论上相关的图形类，其与单调上的某些常量绑定的所有图形类正交图形属性。

著录项

来源
《Machine Learning》 |2019年第7期|1137-1164|共28页
作者
Welke Pascal; Horvath Tamas; Wrobel Stefan;
展开▼
作者单位

Univ Bonn Bonn Germany;

Univ Bonn Bonn Germany|Fraunhofer IAIS Schloss Birlinghoven St Augustin Germany|Fraunhofer Ctr Machine Learning St Augustin Germany;

Univ Bonn Bonn Germany|Fraunhofer IAIS Schloss Birlinghoven St Augustin Germany|Fraunhofer Ctr Machine Learning St Augustin Germany;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Pattern mining; Frequent subgraph mining; Frequent subtree mining; Probabilistic patterns;

机译：模式挖掘;频繁的子图挖掘;频繁的子树挖掘;概率模式;

相似文献

外文文献
中文文献
专利

1. Probabilistic and exact frequent subtree mining in graphs beyond forests [J] . Welke Pascal, Horvath Tamas, Wrobel Stefan Machine Learning . 2019,第7期

机译：在森林以外的图中概率和精确的频繁子树挖掘
2. Probabilistic frequent subtrees for efficient graph classification and retrieval [J] . Pascal Welke, Tamás Horváth, Stefan Wrobel Machine Learning . 2018,第11期

机译：概率频繁子树用于有效的图分类和检索
3. Mining of closed frequent subtrees from frequently updated databases [J] . Viet Anh Nguyen, Akihiro Yamamoto Intelligent data analysis . 2012,第6期

机译：从频繁更新的数据库中挖掘封闭的频繁子树
4. Mining Subtrees with Frequent Occurrence of Similar Subtrees [C] . Hisashi Tosaka, Atsuyoshi Nakamura, Mineichi Kudo International Conference on Discovery Science(DS 2007); 20071001-04; Sendai(JP) . 2007

机译：挖掘频繁出现相似子树的子树
5. Efficient frequent pattern mining over probabilistic databases. [D] . Tong, Yongxin. 2013

机译：通过概率数据库进行有效的频繁模式挖掘。
6. FREQUENT SUBGRAPH MINING OF PERSONALIZED SIGNALING PATHWAY NETWORKS GROUPS PATIENTS WITH FREQUENTLY DYSREGULATED DISEASE PATHWAYS AND PREDICTS PROGNOSIS [O] . Arda Durmaz, Tim A. D. Henderson, Douglas Brubaker, -1

机译：频繁失调的疾病通路和预测的个性化信号通路网络组的频率子图挖掘
7. Probabilistic and exact frequent subtree mining in graphs beyond forests [O] . Pascal Welke, Tamás Horváth, Stefan Wrobel 2019

机译：在森林之外的图表中的概率和精确频繁的子树挖掘

Probabilistic and exact frequent subtree mining in graphs beyond forests

摘要

著录项

相似文献

相关主题

期刊订阅