【24h】

Heterogenous Data Fusion via a Probabilistic Latent-Variable Model

机译:通过概率潜在变量模型进行异构数据融合

获取原文
获取原文并翻译 | 示例

摘要

In a pervasive computing environment, one is facing the problem of handling heterogeneous data from different sources, transmitted over heterogeneous channels and presented on heterogeneous user interfaces. This calls for adaptive data representations keeping as much relevant information as possible while keeping the representation as small as possible. Typically, the gathered data can be high-dimensional vectors with different types of attributes, e.g. continuous, binary and categorical data. In this paper we present - as a first step - a probabilistic latent-variable model, which is capable of fusing high-dimensional heterogenous data into a unified low-dimensional continuous space, and thus brings great benefits for multivariate data analysis, visualization and dimensionality reduction. We adopt a variational approximation to the likelihood of observed data and describe an EM algorithm to fit the model. The advantages of the proposed model are illustrated on toy data and used on real-world painting image data for both visualization and recommendation.
机译:在普适计算环境中,面临着处理来自不同源,通过异构通道传输并呈现在异构用户界面上的异构数据的问题。这要求自适应数据表示在保持表示尽可能小的同时保持尽可能多的相关信息。通常,收集的数据可以是具有不同类型属性的高维向量,例如属性。连续,二进制和分类数据。在本文中,我们首先提出一个概率潜变量模型,该模型能够将高维异构数据融合到统一的低维连续空间中,从而为多变量数据分析,可视化和维化带来巨大好处减少。我们对观测数据的可能性采用变分近似,并描述了适合模型的EM算法。所提出的模型的优点在玩具数据上说明,并在现实世界的绘画图像数据上用于可视化和推荐。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号