首页> 外文会议>IEEE Congress on Evolutionary Computation >Using Semantic Similarity Matrix for Defining Operations involved in NTSO for Clustering 20 News Groups
【24h】

Using Semantic Similarity Matrix for Defining Operations involved in NTSO for Clustering 20 News Groups

机译:使用语义相似性矩阵定义群体群集20个新闻组中涉及的操作

获取原文

摘要

In this research, we propose the similarity matrix based version of NTSO as the approach to the text clustering. For using one of traditional approaches to text clustering, documents should be encoded into numerical vectors; encoding so causes the two main problems: the huge dimensionality and the sparse distribution. In order to solve the problems, in this research, we propose to encode documents into string vectors and use the NTSO (Neural Text Self Organization) as the string vector based neural network for the text clustering. By encoding documents into another form, we attempt to avoid the two main problems, completely. As the empirical validation, the proposed approach will be compared with others with respect to the clustering performance and speed.
机译:在这项研究中,我们提出了基于Matrix的NTSO版本作为文本群集的方法。对于使用传统方法之一进行文本聚类,应将文档编码为数字向量;编码所以导致两个主要问题:巨大的维度和稀疏分布。为了解决问题,在本研究中,我们建议将文档编码为串向量,并使用NTSO(神经文本自组织)作为文本群集的基于串向量的神经网络。通过将文件编码为另一种形式,我们试图避免完全避免两个主要问题。作为经验验证,将与其他人相比,将拟议的方法与集群性能和速度进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号