Similarity Measure Algorithm for Text Document Clustering, Using Singular Value Decomposition

Valentina Adu; Michael Donkor Adane; Kwadwo Asante

首页> 外文期刊>Current Journal of Applied Science and Technology >Similarity Measure Algorithm for Text Document Clustering, Using Singular Value Decomposition

【24h】

Similarity Measure Algorithm for Text Document Clustering, Using Singular Value Decomposition

机译：相似度测量文本文档聚类的算法，使用奇异值分解

获取原文

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We examined a similarity measure between text documents clustering. Data mining is a challenging field with more research and application areas. Text document clustering, which is a subset of data mining helps groups and organizes a large quantity of unstructured text documents into a small number of meaningful clusters. An algorithm which works better by calculating the degree of closeness of documents using their document matrix was used to query the terms/words in each document. We also determined whether a given set of text documents are similar/different to the other when these terms are queried. We found that, the ability to rank and approximate documents using matrix allows the use of Singular Value Decomposition (SVD) as an enhanced text data mining algorithm. Also, applying SVD to a matrix of a high dimension results in matrix of a lower dimension, to expose the relationships in the original matrix by ordering it from the most variant to the lowest.

机译：我们检查了文本文档聚类之间的相似性度量。数据挖掘是一个具有挑战性的领域，具有更多的研究和应用领域。文本文档群集是数据挖掘的子集帮助组并将大量非结构化文本文档组织成少量有意义的集群。通过计算使用其文档矩阵计算文档的亲密度更好的算法用于查询每个文档中的术语/单词。当查询这些条款时，我们还确定了一组给定的文本文档文件是否与另一组不同/不同。我们发现，使用矩阵排列和近似文档的能力允许使用奇异值分解（SVD）作为增强的文本数据挖掘算法。此外，将SVD应用于高维的矩阵导致较低尺寸的矩阵，通过将其从最大变量排序到最低限制来暴露原始矩阵中的关系。

著录项

来源
《Current Journal of Applied Science and Technology》 |2021年第22期|共18页
作者
Valentina Adu; Michael Donkor Adane; Kwadwo Asante;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类科学、科学研究;
关键词
Data miningsimilarityterm frequencysingular value decompositionclustering;

机译：数据俘获敏捷性分解Clusterm;

相似文献

外文文献
中文文献
专利

1. An Extensive Study of Similarity and Dissimilarity Measures Used for Text Document Clustering using K-means Algorithm [J] . Maedeh Afzali, Suresh Kumar International Journal of Information Technology and Computer Science . 2018,第9期

机译：基于K-means算法的文本文档聚类中相似度和相异度度量的广泛研究
2. An Efficient Technique to Implement Similarity Measures in Text Document Clustering using Artificial Neural Networks Algorithm [J] . K. Selvi, R.M. Suresh Research journal of applied science, engineering and technology . 2014,第23期

机译：利用人工神经网络算法在文本文档聚类中实现相似性度量的有效技术
3. An Efficient Technique to Implement Similarity Measures in Text Document Clustering using Artificial Neural Networks Algorithm [J] . K. Selvi, R.M. Suresh Research journal of applied science, engineering and technology . 2014,第23期

机译：利用人工神经网络算法在文本文档聚类中实现相似度度量的有效技术
4. Comparison of dimensional reduction using the Singular Value Decomposition Algorithm and the Self Organizing Map Algorithm in clustering result of text documents [C] . Muhammad Ihsan Jambak, Ahmad Ikrom Izzuddin Jambak Joint Conference on Green Engineering Technology and Applied Computing . 2019

机译：使用奇异值分解算法的尺寸减少与文本文档聚类结果中的自组织地图算法
5. Dynamic Document Clustering using singular value decomposition. [D] . Ramesh, Rashmi Nadubeedi. 2011

机译：使用奇异值分解的动态文档聚类。
6. Swarm Intelligence Algorithms in Text Document Clustering with Various Benchmarks [O] . Suganya Selvaraj, Eunmi Choi 2021

机译：文本文档集群中的群智能算法与各种基准
7. An Efficient Technique to Implement Similarity Measures in Text Document Clustering using Artificial Neural Networks Algorithm [O] . K. Selvi, R.M. Suresh 2014

机译：使用人工神经网络算法在文本文档聚类中实现相似度测量的有效技术

Similarity Measure Algorithm for Text Document Clustering, Using Singular Value Decomposition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅