Fuzzy discrete correlation for document clustering

机译：用于文档聚类的模糊离散相关

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nowadays, there is an enormous growth in the quantity of text documents on the Internet, digital libraries and news sources. This has led to an increased interest in developing methods that help users to effectively navigate, summarize, and organize this information. A new method that uses neighbor and link concepts has more suitable performance than previous methods in this field. Two documents are neighbors if their similarity is more than a defined threshold. If they are neighbors, neighbor matrix element is set to one, otherwise it is set to zero. So we lose some information about documents similarity in it and therefore decrease of accuracy. To overcome this problem, we propose two methods of “discrete correlation” and “fuzzy correlation”, which both of them attempt to accurate neighbor definition more and more and so reach better clustering results. To evaluate our work, we used k-means algorithm to determine the initial cluster centers and similarity criteria between documents and centers. The results of applying proposed method on real-world document data sets by information retrieval factors show better performance than traditional algorithms and previous works.

机译：如今，Internet，数字图书馆和新闻来源上的文本文档数量有了巨大的增长。这导致人们对开发帮助用户有效导航，汇总和组织此信息的方法的兴趣日益浓厚。使用邻居和链接概念的新方法比该领域中的先前方法具有更合适的性能。如果两个文档的相似度超过定义的阈值，则它们是邻居。如果它们是邻居，则将邻居矩阵元素设置为1，否则将其设置为0。因此，我们会丢失一些有关文档相似性的信息，因此会降低准确性。为了克服这个问题，我们提出了“离散相关”和“模糊相关”两种方法，它们都试图越来越精确地定义邻居，从而获得更好的聚类结果。为了评估我们的工作，我们使用k-means算法来确定初始聚类中心以及文档和中心之间的相似性标准。通过信息检索因素将方法应用于现实世界文档数据集的结果显示，其性能优于传统算法和先前的工作。

著录项

来源
《2011 International Symposium on Artificial Intelligence and Signal Processing》|2011年|p.59-65|共7页
会议地点
作者
Danesh Malihe; Naghibzadeh Mahmoud; Harati Ahad;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
correlation; discrete; document clustering; fuzzy; link; neighbor; similarity;

机译：相关;离散;文档聚类;模糊;链接;邻居;相似性;

相似文献

外文文献
中文文献
专利

1. Clustering documents with labeled and unlabeled documents using fuzzy semi-Kmeans [J] . Chien-Liang Liu, Tao-Hsing Chang, Hsuan-Hsun Li Fuzzy sets and systems . 2013,第juna16期

机译：使用模糊半均值将文档与带标签和未带标签的文档聚类
2. A modified fuzzy clustering for documents retrieval: Application to document categorization [J] . S. Nefti, Y. Rezgui, M. Oussalah Operations Research . 2010,第1a2期

机译：一种改进的文档检索模糊聚类：在文档分类中的应用
3. Enhancing Document Clustering Using Condensing Cluster Terms and Fuzzy Association [J] . Sun PARK, Seong Ro LEE IEICE transactions on information and systems . 2011,第6期

机译：使用压缩聚类项和模糊关联来增强文档聚类
4. Fuzzy discrete correlation for document clustering [C] . Danesh Malihe, Naghibzadeh Mahmoud, Harati Ahad International Symposium on Artificial Intelligence and Signal Processing . 2011

机译：文档聚类的模糊离散相关性
5. On the Solution of Discrete-Valued Inverse Problems Through Guided Fuzzy c-Means Clustering [D] . Maag-Capriotti, Elizabeth Marie. 2020

机译：通过引导模糊C-MERIAL聚类对离散逆问题的解决方案
6. A New Validity Measure for a Correlation-Based Fuzzy C-means Clustering Algorithm [O] . Mingrui Zhang, Wei Zhang, Hugues Sicotte, -1

机译：基于相关性的模糊C-均值聚类算法的有效性检验
7. Fuzzy clustering of web documents using equivalence relations and fuzzy hierarchical clustering [O] . kumar, Satendra, kathuria, Mamta, Gupta, Alok Kumar, 2014

机译：基于等价关系和模糊数学的Web文档模糊聚类层次聚类

Fuzzy discrete correlation for document clustering

摘要

著录项

相似文献

相关主题

期刊订阅