SM based Operation for Specializing a Fast Clustering Algorithm for Text Clustering

机译：用于文本聚类的快速聚类算法的基于SM的操作

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This research proposes a new strategy where documents are encoded into string vectors for text clustering and modified versions of single pass algorithms to be adaptable to string vectors. Traditionally, when the single pass algorithm is used for pattern clustering, raw data should be encoded into numerical vectors. This encoding may be difficult, depending on a given application area of pattern clustering. For example, in text clustering, encoding full texts given as raw data into numerical vectors leads to two main problems: huge dimensionality and sparse distribution. In order to address the two problems, in this research, we encode full texts into string vectors, and apply single pass algorithm to string vectors for text clustering.

机译：这项研究提出了一种新的策略，其中将文档编码为字符串向量以进行文本聚类，并修改单次通过算法的版本以适应字符串向量。传统上，当将单遍算法用于模式聚类时，应将原始数据编码为数值向量。取决于模式聚类的给定应用领域，这种编码可能很困难。例如，在文本聚类中，将作为原始数据给出的全文编码为数值向量会导致两个主要问题：巨大的维数和稀疏的分布。为了解决这两个问题，在本研究中，我们将全文编码为字符串向量，并将单遍算法应用于字符串向量以进行文本聚类。

著录项

来源
《Proceedings of the 2007 International Conference on Artificial Intelligence(ICAI'2007)》|2007年|P.777780|共2页
会议地点
作者
Taeho Jo; Malrey Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Semantic string operation for specializing AHC algorithm for text clustering [J] . Jo Taeho Annals of Mathematics and Artificial Intelligence . 2020,第10期

机译：专用AHC算法的语义字符串操作
2. Improved fast partitional clustering algorithm for text clustering [J] . Bejos Sebastian, Feliciano-Avelino Ivan, Martinez-Trinidad J. Fco., Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2020,第2Pta2期

机译：改进的文本群集快速分区聚类算法
3. Probability-based text clustering algorithm by alternately repeating two operations [J] . Ming Liu, Yuanchao Liu, Bingquan Liu, Journal of Information Science . 2013,第3期

机译：通过交替重复两个操作的基于概率的文本聚类算法
4. SM based Operation for Specializing a Fast Clustering Algorithm for Text Clustering [C] . Taeho Jo, Malrey Lee International Conference on Artificial Intelligence . 2007

机译：基于SM基于文本群集的快速聚类算法的SM
5. Novel approaches to clustering, biclustering algorithms based on adaptive resonance theory and intelligent control. [D] . Kim, Sejun. 2016

机译：基于自适应共振理论和智能控制的新型聚类，双聚类算法。
6. Shrinkage Clustering: a fast and size-constrained clustering algorithm for biomedical applications [O] . Chenyue W. Hu, Hanyang Li, Amina A. Qutub 2018

机译：收缩聚类：用于生物医学应用的快速且受大小限制的聚类算法
7. Design and Application of a Text Clustering Algorithm Based on Parallelized K-Means Clustering [O] . Hui Wang, Chengdong Zhou, Leixiao Li 2019

机译：基于并行k均值聚类的文本聚类算法的设计与应用

SM based Operation for Specializing a Fast Clustering Algorithm for Text Clustering

摘要

著录项

相似文献

相关主题

期刊订阅