首页> 外文会议>International Conference on Enterprise Information Systems >HIERARCHICAL MODEL-BASED CLUSTERING FOR RELATIONAL DATA
【24h】

HIERARCHICAL MODEL-BASED CLUSTERING FOR RELATIONAL DATA

机译:基于分层模型的关系数据群集

获取原文
获取外文期刊封面目录资料

摘要

Relational data mining deals with datasets containing multiple types of objects and relationships that are presented in relational formats, e.g. relational databases that have multiple tables. This paper proposes a propositional hierarchical model-based method for clustering relational data. We first define an object-relational star schema to model composite objects, and present a method of flattening composite objects into aggregate objects by introducing a new type of aggregates. frequency aggregate, which can be used to record not only the observed values but also the distribution of the values of an attribute. A hierarchical agglomerative clustering algorithm with log-likelihood distance is then applied to cluster the aggregated data tentatively. After stopping at a coarse estimate of the number of clusters, a mixture model-based method with the EM algorithm is developed to perform a further relocation clustering, in which Bayes Information Criterion is used to determine the optimal number of clusters. Finally we evaluate our approach on a real-world dataset.
机译:关系数据挖掘与包含多种类型的对象和关系以关系格式提出的数据集进行处理,例如,具有多个表的关系数据库。本文提出了一种用于聚类关系数据的主题基于模型的方法。我们首先将一个对象 - 关系之星架构定义为模型复合对象,并通过引入新类型的聚合来呈现将复合对象展平复合对象的方法。频率聚合,可用于不仅记录观察到的值,还可以用于录制属性值的分布。然后应用具有逻辑似然距离的分层凝聚聚类算法以暂时延长聚合数据。在停止在群集的粗略估计之后,开发了一种利用EM算法的基于混合模型的方法,以执行进一步的重定位聚类,其中使用贝叶信息标准来确定最佳簇数。最后,我们在真实世界数据集中评估我们的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号