【24h】

A Newton's Universal Gravitation Inspired Firefly Algorithm for Document Clustering

机译:牛顿万有引力启发萤火虫算法的文档聚类

获取原文

摘要

The divisive clustering has the advantage to build a hierarchical structure that is more efficient to represent documents in search engines. Its operation employs one of the partition clustering algorithms that leads to being trapped in a local optima. This paper proposes a Firefly algorithm that is based on Newton's law of universal gravitation, known as Gravitation Firefly Algorithm (GFA), for document clustering. GFA is used to find centers of clusters based on objective function that maximizes the force between each document and an initial center. Upon identification of a center, the algorithm then locates documents that are similar to the center using cosine similarity function. The process of finding centers for new clusters continues by sorting the light intensity values of the balance documents. Experimental results on Reuters datasets showed that the proposed Newton inspired Firefly algorithm is suitable to be used for document clustering in text mining.
机译:分开式聚类的优点是可以构建层次结构,该层次结构可以更有效地表示搜索引擎中的文档。其操作采用分区聚类算法之一,该算法导致陷入局部最优状态。本文提出了一种基于牛顿万有引力定律的萤火虫算法,称为重力萤火虫算法(GFA),用于文档聚类。 GFA用于根据目标函数查找聚类中心,该目标函数可最大化每个文档与初始中心之间的作用力。在识别中心之后,该算法然后使用余弦相似度函数来查找与该中心相似的文档。通过对平衡文件的光强度值进行排序,继续寻找新群集的中心。在路透数据集上的实验结果表明,提出的牛顿启发式Firefly算法适合用于文本挖掘中的文档聚类。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号