首页> 外文会议>International Conference on Large-Scale Knowledge Resources >Identification of MCMC Samples for Clustering
【24h】

Identification of MCMC Samples for Clustering

机译:用于聚类MCMC样本的识别

获取原文

摘要

For clustering problems, many studies use just MAP assignments to show clustering results instead of using whole samples from a MCMC sampler. This is because it is not straightforward to recognize clusters based on whole samples. Thus, we proposed an identification algorithm which constructs groups of relevant clusters. The identification exploits spectral clustering to group clusters. Although a naive spectral clustering algorithm is intractable due to memory space and computational time, we developed a memory-and-time efficient spectral clustering for samples of a MCMC sampler. In experiments, we show our algorithm is tractable for real data while the naive algorithm is intractable. For search query log data, we also show representative vocabularies of clusters, which cannot be chosen by just MAP assignments.
机译:对于群集问题,许多研究只使用地图分配来显示聚类结果,而不是使用MCMC采样器的整个样本。这是因为识别基于整个样本的群集并不直接。因此,我们提出了一种构建相关簇组的识别算法。该识别利用频谱聚类到组集群。尽管天真的光谱聚类算法由于存储空间和计算时间而难以相容,但是我们为MCMC采样器的样本开发了一个内存和时间高效的光谱聚类。在实验中,我们显示我们的算法对于实际数据而言是易于实际数据的,而Naive算法是棘手的。对于搜索查询日志数据,我们还显示了群集的代表性词汇表,只需映射分配就无法选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号