首页> 外文会议>International Conference on Computational Science and Its Applications >Table Based Single Pass Algorithm for Clustering News Articles in NewsPage.com
【24h】

Table Based Single Pass Algorithm for Clustering News Articles in NewsPage.com

机译:基于表基单通算法在NewsPage.com中群集新闻文章

获取原文

摘要

This research proposes a modified version of single pass algorithm specialized for text clustering. Encoding documents into numerical vectors for using the traditional version of single pass algorithm causes the two main problems: huge dimensionality and sparse distribution. Therefore, in order to address the two problems, this research modifies the single pass algorithm into its version where documents are encoded into other forms than numerical vectors. In the proposed version, documents are mapped into tables and an operation on two tables is defined for using the single pass algorithm. The goal of this research is to improve the performance of single pass algorithm for text clustering by modifying it.
机译:本研究提出了一种专门用于文本群集的单通算法的修改版本。将文档编码为使用传统版本的单通算法的数字矢量导致两个主要问题:巨大的维度和稀疏分布。因此,为了解决这两个问题,这项研究将单通算法修改为其版本,其中文档被编码成其他形式而不是数字向量。在所提出的版本中,文档被映射到表中,并为使用单通算法定义两个表上的操作。本研究的目标是通过修改它来改善文本聚类的单通算法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号