Improved fast partitional clustering algorithm for text clustering

Bejos Sebastian; Feliciano-Avelino Ivan; Martinez-Trinidad J. Fco.; Carrasco-Ochoa J. A.

首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >Improved fast partitional clustering algorithm for text clustering

【24h】

Improved fast partitional clustering algorithm for text clustering

机译：改进的文本群集快速分区聚类算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Document clustering has become an important task for processing the big amount of textual information available on the Internet. On the other hand, k-means is the most widely used algorithm for clustering, mainly due to its simplicity and effectiveness. However, k-means becomes slow for large and high dimensional datasets, such as document collections. Recently the FPAC algorithm was proposed to mitigate this problem, but the improvement in the speed was reached at the cost of reducing the quality of the clustering results. For this reason, in this paper, we introduce an improved FPAC algorithm, which, according our experiments on different document collections, allows obtaining better clustering results than FPAC, without highly increasing the runtime.

机译：文档群集已成为处理互联网上可用的大量文本信息的重要任务。另一方面，K-means是最广泛使用的聚类算法，主要是由于其简单性和有效性。然而，K-Meanse对于大型和高维数据集（例如文档集）而变慢。最近，提出了FPAC算法来减轻这个问题，但速度的提高以降低聚类结果的质量的成本达到。因此，在本文中，我们介绍了一种改进的FPAC算法，根据我们对不同文档集合的实验，允许获得比FPAC更好的聚类结果，而无需高度增加运行时。

著录项

来源
《Journal of intelligent & fuzzy systems: Applications in Engineering and Technology》 |2020年第2期|共9页
作者
Bejos Sebastian; Feliciano-Avelino Ivan; Martinez-Trinidad J. Fco.; Carrasco-Ochoa J. A.;
展开▼
作者单位

Inst Nacl Astrofis Opt &

Electr Puebla Mexico;

Inst Nacl Astrofis Opt &

Electr Puebla Mexico;

Inst Nacl Astrofis Opt &

Electr Puebla Mexico;

Inst Nacl Astrofis Opt &

Electr Puebla Mexico;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化系统;
关键词
Document clustering; large collection; high dimensionality;

机译：文档聚类;大集合;高维度;

相似文献

外文文献
中文文献
专利

1. Improved fast partitional clustering algorithm for text clustering [J] . Bejos Sebastian, Feliciano-Avelino Ivan, Martinez-Trinidad J. Fco., Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2020,第2Pta2期

机译：改进的文本群集快速分区聚类算法
2. An Improved Method of Fuzzy Clustering Algorithm and Its Application in Text Clustering [J] . Hongfen Jiang, Feiyue Ye, Junfcng Gu, Journal of information and computational science . 2013,第2期

机译：模糊聚类算法的改进方法及其在文本聚类中的应用
3. An improved density peaks clustering algorithm with fast finding cluster centers [J] . Xiao Xu, Shifei Ding, Zhongzhi Shi Knowledge-Based Systems . 2018,第OCTa15期

机译：一种具有快速发现聚类中心的改进的密度峰聚类算法
4. Improving Alternative Text Clustering Quality in the Avoiding Bias Task with Spectral and Flat Partition Algorithms [C] . M. Eduardo Ares, Javier Parapar, Alvaro Barreiro DEXA 2010;International conference on database and expert systems applications . 2010

机译：通过频谱和平面划分算法提高避免偏见任务中替代文本聚类的质量
5. Several optimization algorithms on clustering and graph partitioning [D] . Xu, Yi 2014

机译：聚类和图划分的几种优化算法
6. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm Minimum Spanning Tree and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法最小生成树和分层聚类的三种混合方法的比较
7. Improving Alternative Text Clustering Quality in the Avoiding Bias Task with Spectral and Flat Partition Algorithms [O] . M. Eduardo Ares, Javier Parapar, Álvaro Barreiro 2010

机译：用谱和平分区算法提高避免偏差任务中的替代文本聚类质量
8. Measuring Constraint-Set Utility for Partitional Clustering Algorithms [R] . Davidson, Ian, Wagstaff, Kiri L., Basu, Sugato 2006

机译：测量分区聚类算法的约束集效用

Improved fast partitional clustering algorithm for text clustering

摘要

著录项

相似文献

相关主题

期刊订阅