首页> 美国卫生研究院文献>PLoS Clinical Trials >SOTXTSTREAM: Density-based self-organizing clustering of text streams
【2h】

SOTXTSTREAM: Density-based self-organizing clustering of text streams

机译:SOTXTSTREAM:基于密度的文本流自组织群集

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。
获取外文期刊封面目录资料

摘要

A streaming data clustering algorithm is presented building upon the density-based self-organizing stream clustering algorithm SOSTREAM. Many density-based clustering algorithms are limited by their inability to identify clusters with heterogeneous density. SOSTREAM addresses this limitation through the use of local (nearest neighbor-based) density determinations. Additionally, many stream clustering algorithms use a two-phase clustering approach. In the first phase, a micro-clustering solution is maintained online, while in the second phase, the micro-clustering solution is clustered offline to produce a macro solution. By performing self-organization techniques on micro-clusters in the online phase, SOSTREAM is able to maintain a macro clustering solution in a single phase. Leveraging concepts from SOSTREAM, a new density-based self-organizing text stream clustering algorithm, SOTXTSTREAM, is presented that addresses several shortcomings of SOSTREAM. Gains in clustering performance of this new algorithm are demonstrated on several real-world text stream datasets.
机译:在基于密度的自组织流聚类算法SOSTREAM的基础上,提出了一种流式数据聚类算法。许多基于密度的聚类算法因无法识别具有异构密度的聚类而受到限制。 SOSTREAM通过使用本地(基于最近邻居)密度确定来解决此限制。另外,许多流聚类算法使用两阶段聚类方法。在第一阶段,微集群解决方案保持联机状态,而在第二阶段,将微集群解决方案脱机集群化以生成宏解决方案。通过在联机阶段对微型集群执行自组织技术,SOSTREAM能够在单个阶段维护宏集群解决方案。利用SOSTREAM的概念,提出了一种新的基于密度的自组织文本流聚类算法SOTXTSTREAM,它解决了SOSTREAM的一些缺点。在一些实际的文本流数据集上证明了这种新算法的聚类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号