首页> 外文期刊>International journal of data mining, modelling and management >Approximating the covariance matrix of GMMs with low-rank perturbations
【24h】

Approximating the covariance matrix of GMMs with low-rank perturbations

机译:低秩扰动逼近GMM的协方差矩阵

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Covariance matrices capture correlations that are invaluable in modeling real-life datasets. Using all d~2 elements of the covariance (in d dimensions) is costly and could result in over-fitting; and the simple diagonal approximation can be over-restrictive. In this work, we present a new model, the low-rank Gaussian mixture model (LRGMM), for modeling data which can be extended to identifying partitions or overlapping clusters. The curse of dimensionality that arises in calculating the covariance matrices of the GMM is countered by using low-rank perturbed diagonal matrices. The efficiency is comparable to the diagonal approximation, yet one can capture correlations among the dimensions. Our experiments reveal the LRGMM to be an efficient and highly applicable tool for working with large high-dimensional datasets.
机译:协方差矩阵可捕获在对现实生活数据集进行建模时非常宝贵的相关性。使用协方差的所有d〜2元素(以d维为单位)是昂贵的,并且可能导致过度拟合。并且简单的对角线近似可能过于严格。在这项工作中,我们提出了一个新模型,即低阶高斯混合模型(LRGMM),用于对数据进行建模,该数据可以扩展为识别分区或重叠集群。通过使用低秩扰动对角矩阵可以抵消在计算GMM协方差矩阵时出现的维数诅咒。效率可与对角线近似相媲美,但可以捕获尺寸之间的相关性。我们的实验表明,LRGMM是处理大型高维数据集的有效且高度适用的工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号