【24h】

Multimodal factor analysis

机译:多峰因子分析

获取原文

摘要

A multimodal system with Poisson, Gaussian, and multinomial observations is considered. A generative graphical model that combines multiple modalities through common factor loadings is proposed. In this model, latent factors are like summary objects that has latent factor scores in each modality, and the observed objects are represented in terms of such summary objects. This potentially brings about a significant dimensionality reduction. It also naturally enables a powerful means of clustering based on a diverse set of observations. An expectation-maximization (EM) algorithm to find the model parameters is provided. The algorithm is tested on a Twitter dataset which consists of the counts and geographical coordinates of hashtag occurrences, together with the bag of words for each hashtag. The resultant factors successfully localizes the hashtags in all dimensions: counts, coordinates, topics. The algorithm is also extended to accommodate von Mises-Fisher distribution, which is used to model the spherical coordinates.
机译:考虑了具有泊松,高斯和多项观测的多模态系统。提出了一种通过公共因子加载结合多种方式的生成图形模型。在该模型中,潜在因子就像在每个模态中具有潜在因子分数的概要对象,并且观察到的对象以这种摘要对象表示。这可能带来了显着的维度减少。它还自然能够基于各种观察组件实现强大的聚类手段。提供了一个预期最大化(EM)算法来找到模型参数。该算法在Twitter数据集上进行了测试,该数据集由HashTag出现的计数和地理坐标组成,以及每个HASHTAG的单词袋。所产生的因素成功地定位了所有尺寸中的HashTags:计数,坐标,主题。该算法还扩展以适应Von Mises-Fisher分布,用于模拟球形坐标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号