...
首页> 外文期刊>Data technologies and applications >A scalable eigenspace-based fuzzy c-means for topic detection
【24h】

A scalable eigenspace-based fuzzy c-means for topic detection

机译:一个可伸缩的eigenspace-based模糊c话题检测

获取原文
获取原文并翻译 | 示例
           

摘要

Purpose The aim of this research is to develop an eigenspace-based fuzzy c-means method for scalable topic detection. Design/methodology/approach The eigenspace-based fuzzy c-means (EFCM) combines representation learning and clustering. The textual data are transformed into a lower-dimensional eigenspace using truncated singular value decomposition. Fuzzy c-means is performed on the eigenspace to identify the centroids of each cluster. The topics are provided by transforming back the centroids into the nonnegative subspace of the original space. In this paper, we extend the EFCM method for scalability by using the two approaches, i.e. single-pass and online. We call the developed topic detection methods as oEFCM and spEFCM. Findings Our simulation shows that both oEFCM and spEFCM methods provide faster running times than EFCM for data sets that do not fit in memory. However, there is a decrease in the average coherence score. For both data sets that fit and do not fit into memory, the oEFCM method provides a tradeoff between running time and coherence score, which is better than spEFCM. Originality/value This research produces a scalable topic detection method. Besides this scalability capability, the developed method also provides a faster running time for the data set that fits in memory.
机译:目的本研究的目的是开发一个eigenspace-based模糊c均值方法可伸缩的话题检测。设计/方法/方法eigenspace-based模糊c均值(EFCM)结合表示学习和聚类。转化为低维特征空间用截断奇异值分解。在特征空间上执行模糊c确定每个集群的重心。主题提供的改变了质心到非负的子空间原来的空间。通过使用这两个方法的可伸缩性方法,即单次的和在线。发达oEFCM话题检测方法和spEFCM。oEFCM和spEFCM提供更快的方法运行时间比EFCM数据集不适合在内存中。相干平均分数。适合和不适合内存,oEFCM方法提供了一个运行时间之间的权衡和一致性评分,这是比spEFCM更好。创意/值研究产生可伸缩的话题检测方法。可扩展性功能,开发方法为数据集提供了更快的运行时间适合在内存中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号