...
首页> 外文期刊>Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies >On the equivalence between nonnegative tensor factorization and tensorial probabilistic latent semantic analysis
【24h】

On the equivalence between nonnegative tensor factorization and tensorial probabilistic latent semantic analysis

机译:非负张量因子分解与张量概率潜在语义分析的等价性

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Non-negative Matrix Factorization (NMF) and Probabilistic Latent Semantic Analysis (PLSA) are two widely used methods for non-negative data decomposition of two-way data (e.g., document-term matrices). Studies have shown that PLSA and NMF (with the Kullback-Leibler divergence objective) are different algorithms optimizing the same objective function. Recently, analyzing multi-way data (i.e., tensors), has attracted a lot of attention as multi-way data have rich intrinsic structures and naturally appear in many real-world applications. In this paper, the relationships between NMF and PLSA extensions on multi-way data, e.g., NTF (Non-negative Tensor Factorization) and T-PLSA (Tensorial Probabilistic Latent Semantic Analysis), are studied. Two types of T-PLSA models are shown to be equivalent to two well-known non-negative factorization models: PARAFAC and Tucker3 (with the KL-divergence objective). NTF and T-PLSA are also compared empirically in terms of objective functions, decomposition results, clustering quality, and computation complexity on both synthetic and real-world datasets. Finally, we show that a hybrid method by running NTF and T-PLSA alternatively can successfully jump out of each other's local minima and thus be able to achieve better clustering performance.
机译:非负矩阵因式分解(NMF)和概率潜在语义分析(PLSA)是两种广泛使用的双向数据(例如文档项矩阵)非负数据分解方法。研究表明PLSA和NMF(具有Kullback-Leibler发散目标)是优化同一目标函数的不同算法。近来,由于多路数据具有丰富的固有结构并且自然地出现在许多实际应用中,因此分析多路数据(即张量)引起了很多关注。在本文中,研究了NMF和PLSA扩展在多路数据上的关系,例如NTF(非负张量因式分解)和T-PLSA(张量概率潜在语义分析)。已显示两种类型的T-PLSA模型等效于两种众所周知的非负因子分解模型:PARAFAC和Tucker3(具有KL散度目标)。还对合成和真实数据集的目标函数,分解结果,聚类质量和计算复杂度进行了经验比较,比较了NTF和T-PLSA。最后,我们证明了通过运行NTF和T-PLSA的混合方法可以成功跳出彼此的局部最小值,从而能够实现更好的聚类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号