首页> 外文会议>European conference on machine learning and principles and practice of knowledge discovery in databases >Vine Copulas for Mixed Data: Multi-view Clustering for Mixed Data Beyond Meta-Gaussian Dependencies
【24h】

Vine Copulas for Mixed Data: Multi-view Clustering for Mixed Data Beyond Meta-Gaussian Dependencies

机译:用于混合数据的Vine Copulas:超越元高斯依存关系的混合数据的多视图聚类

获取原文

摘要

Copulas enable flexible parameterization of multivariate distributions in terms of constituent marginals and dependence families. Vine copulas, hierarchical collections of bivariate copulas, can model a wide variety of dependencies in multivariate data including asymmetric and tail dependencies which the more widely used Gaussian copulas, used in Meta-Gaussian distributions, cannot. However, current inference algorithms for vines cannot fit data with mixed-a combination of continuous, binary and ordinal-features that are common in many domains. We design a new inference algorithm to fit vines on mixed data thereby extending their use to several applications. We illustrate our algorithm by developing a dependency-seeking multi-view clustering model based on Dirichlet Process mixture of vines that generalizes previous models to arbitrary dependencies as well as to mixed marginals. Empirical results on synthetic and real datasets demonstrate the performance on clustering single-view and multi-view data with asymmetric and tail dependencies and with mixed marginals.
机译:通过Copulas,可以根据组成边际和依赖项族灵活地对多元分布进行参数化。葡萄系动词是双变量系动词的分层集合,可以对多变量数据中的各种依赖性进行建模,包括不对称和尾部依赖性,在元高斯分布中使用的更广泛使用的高斯系动词不能。但是,当前针对藤蔓的推理算法无法将数据混合使用-在许多领域中常见的连续,二进制和有序特征的组合。我们设计了一种新的推理算法,以使藤蔓适合混合数据,从而将其用途扩展到多个应用程序。我们通过开发基于藤蔓的Dirichlet Process混合的依赖关系寻求多视图聚类模型来说明我们的算法,该模型将先前的模型推广到任意依赖关系以及混合边缘。综合和真实数据集上的经验结果表明,在对具有非对称和尾部相关性以及混合边际的单视图和多视图数据进行聚类时的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号