Clustering of text documents with keyword weighting function

A. Christy; G. Meera Gandhi; S. Vaithyasubramanian

首页> 外文期刊>International Journal of Intelligent Enterprise >Clustering of text documents with keyword weighting function

【24h】

Clustering of text documents with keyword weighting function

机译：群集文本文档与关键字加权函数

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this digital world, data is available in abundance everywhere and it is growing at a phenomenal rate. Making data available readily for decision making is an important task of data analyst. In this article, we propose an unsupervised learning algorithm for text document clustering by adopting keyword weighting function. Documents are pre-processed and relevant keywords based on their weights are grouped together. Clustered keyword weighting (CKW) takes each class in the training collection as a known cluster, and searches for feature weights iteratively to optimise the clustering objective function, in order to retrieve the best clustering result. Performance of CKW is validated by clustering BBC news collection text collections. Experiments were conducted with simple K-means, hierarchical clustering algorithms and our keyword weighting and clustering approach has shown improved cluster quality compared to the other methods.

机译：在这一数字世界中，数据无处不在的数据有丰富，它以惊人的速度增长。可以随时为决策制定数据是数据分析师的重要任务。在本文中，我们通过采用关键字加权函数提出了一种无监督的学习算法，用于文本文档群集。文档是预处理的，并且基于其权重的相关关键字被分组在一起。群集关键字加权（CKW）将训练收集中的每个类作为已知的群集，并迭代地搜索特征权重，以优化群集目标函数，以便检索最佳的聚类结果。通过群集BBC新闻集合文本集合验证CKW的性能。使用简单的K-means进行实验，分层聚类算法和我们的关键字加权和聚类方法显示了与其他方法相比的群集质量。

著录项

来源
《International Journal of Intelligent Enterprise》 |2019年第1期|共13页
作者
A. Christy; G. Meera Gandhi; S. Vaithyasubramanian;
展开▼
作者单位

Faculty of Computing Sathyabama Institute of Science and Technology;

Faculty of Computing Sathyabama Institute of Science and Technology;

Department of Mathematics Sathyabama Institute of Science and Technology;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Documents; Cluster; Unsupervised; Feature; K-means; Normalised;

机译：文件;簇;无监督;特征;K-means;标准化;

相似文献

外文文献
中文文献
专利

1. Clustering of text documents with keyword weighting function [J] . A. Christy, G. Meera Gandhi, S. Vaithyasubramanian International Journal of Intelligent Enterprise . 2019,第1期

机译：群集文本文档与关键字加权函数
2. An improved algorithm for weighting keywords in web documents [J] . 孙双, 贺樑, 杨静, 上海大学学报：英文版 . 2008,第003期

机译：Web文档中关键词加权的改进算法
3. A combination of objective functions and hybrid Krill herd algorithm for text document clustering analysis [J] . Abualigah Laith Mohammad, Khader Ahamad Tajudin, Hanandeh Essam Said Engineering Applications of Artificial Intelligence . 2018,第AUGa期

机译：目标函数和混合Krill牛群算法相结合的文本文档聚类分析
4. A Novel Weighting Scheme Applied to Improve the Text Document Clustering Techniques [C] . Laith Mohammad Abualigah, Ahamad Tajudin Khader, Essam Said Hanandeh International conference on the computer science and engineering . 2018

机译：一种新的加权方案，适用于改进文本文档聚类技术
5. Text document topical recursive clustering and automatic labeling of a hierarchy of document clusters. [D] . Li, Xiaoxiao. 2012

机译：文本文档主题递归群集和文档群集层次结构的自动标记。
6. Swarm Intelligence Algorithms in Text Document Clustering with Various Benchmarks [O] . Suganya Selvaraj, Eunmi Choi 2021

机译：文本文档集群中的群智能算法与各种基准
7. Simultaneous Categorization of Text Documents And Identification of Cluster-dependent Keywords [O] . Hichem Frigui, Olfa Nasraoui 2002

机译：文本文档的同时分类和聚类相关关键字的识别
8. Soft Clustering Criterion Functions for Partitional Document Clustering [R] . Zhao, Y. , Karypis, G. 2004

机译：分区文档聚类的软聚类判据函数

Clustering of text documents with keyword weighting function

摘要

著录项

相似文献

相关主题

期刊订阅