首页> 外文会议>SIAM International Conference on Data Mining >Graph Mining with Variational Dirichlet Process Mixture Models
【24h】

Graph Mining with Variational Dirichlet Process Mixture Models

机译:与变形型Dirichlet工艺混合模型的图形挖掘

获取原文

摘要

Graph data such as chemical compounds and XML documents are getting more common in many application domains. A main difficulty of graph data processing lies in the intrinsic high dimensionality of graphs, namely, when a graph is represented as a binary feature vector of indicators of all possible subgraph patterns, the dimensionality gets too large for usual statistical methods. We propose a nonparametric Bayesian method for clustering graphs and selecting salient patterns at the same time. Variational inference is adopted here, because sampling is not applicable due to extremely high dimensionality. The feature set minimizing the free energy is efficiently collected with the DFS code tree, where the generation of useless subgraphs is suppressed by a tree pruning condition. In experiments, our method is compared with a simpler approach based on frequent subgraph mining, and graph kernels.
机译:图诸如化学化合物和XML文档的图表数据在许多应用程序域中越来越常见。图数据处理的主要难度在于图形的内在高维度,即,当曲线图表示为所有可能的子图模式的指示器的二进制特征向量时,对于通常的统计方法,维度太大。我们提出了一种用于聚类图的非参数贝叶斯方法,并同时选择突出模式。此处采用变分推理,因为采样不适用于由于极高的维度。利用DFS码树有效地收集最小化自由能的特征集,其中通过树修剪条件抑制了无用子图的产生。在实验中,我们的方法与基于频繁的子图挖掘的更简单的方法进行比较,以及图形内核。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号