首页> 外文会议>International Conference on Social Computing and Social Media;International Conference on Human-Computer Interaction >Verification of Probabilistic Latent Semantic Analysis Clustering Solution Stability and Proposal of Optimal Initial Values Setting Method
【24h】

Verification of Probabilistic Latent Semantic Analysis Clustering Solution Stability and Proposal of Optimal Initial Values Setting Method

机译:验证概率潜在语义分析聚类解决方案稳定性和最佳初始值设置方法的提案

获取原文

摘要

pLSA is a useful method to know the characteristics of customer or item in marketing. In this study, we proposed a method to set the initial values more efficiently than the existing method for the problem that the final solution depends on the initial values set in the EM algorithm used by pLSA to estimate the solutions. We focused on the dimensional compression and clustering that are the characteristics of pLSA, and thought that the stability of the solution of pLSA would be improved by reflecting it in the initial values. Therefore, first, we performed correspondence analysis and k-means cluster analysis on the original data to express the features of dimensional compression and clustering. Next, we compared the performance of the pLSA results with the initial values of the proposed method and the initial values of the conventional method using random numbers. As a result, it was shown that the proposed method also converges to the same log-likelihood as the conventional method, and that the proposed method is superior in terms of convergence speed and stability.
机译:PLSA是一个有用的方法,可以了解营销中的客户或项目的特征。在这项研究中,我们提出了一种方法来更有效地设置初始值,而不是现有方法,即最终解决方案取决于PLSA使用的EM算法中设置的初始值以估计解决方案。我们专注于尺寸压缩和聚类,这是PLSA的特征,并认为通过在初始值中反射它将改善PLSA溶液的稳定性。因此,首先,我们对原始数据进行了对应分析和K-Means群集分析,以表达尺寸压缩和聚类的特征。接下来,我们将PLSA结果与所提出的方法的初始值和使用随机数的传统方法的初始值进行了比较。结果,显示该方法也将其收敛于与传统方法相同的对数似然性,并且所提出的方法在收敛速度和稳定性方面优越。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号