Semantic Correlation Network Based Text Clustering

机译：基于语义关联网络的文本聚类

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Text documents have sparse data spaces, and nearest neighbors may belong to different classes when using current existing proximity measures to describe the correlation of documents. In this paper, we propose an asymmetric similarity measure to strengthen the discriminative feature of document objects. We construct a semantic correlation network by asymmetric similarity between documents and conjecture the power law feature of the connections distributions. Hub points which exist in semantic correlation network are classified by an agglomerative hierarchical clustering approach named SCN. Both objects similarity and neighbors similarity are considered in the definition of hub points proximity. Finally, we assign the rest text objects to their nearest hub points. The experimental evaluation on textual data sets demonstrates the validity and efficiency of SCN. The comparison with other clustering algorithms shows the superiority of our approach.

机译：文本文档具有稀疏数据空间，并且当使用当前现有的邻近措施来描述文档的相关性时，最近的邻居可能属于不同的类。在本文中，我们提出了一种不对称的相似度措施，以加强文档对象的鉴别特征。通过文档与猜测连接分布的电力法特征之间的不对称相似性来构建语义相关网络。在语义相关网络中存在的集线点由名为SCN的附名分层聚类方法分类。对象相似性和邻居相似度被认为是在集线器点接近的定义中。最后，我们将REST文本对象分配给最近的集线器点。文本数据集的实验评估展示了SCN的有效性和效率。与其他聚类算法的比较显示了我们方法的优越性。

著录项

来源
《Australian Joint Conference on Artificial Intelligence》|2005年||共10页
会议地点
作者
Shaoxu Song; Chunping Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
Data Mining; Knowledge Discovery;

机译：数据挖掘;知识发现;

相似文献

外文文献
中文文献
专利

1. Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification [J] . Wang Peng, Xu Bo, Xu Jiaming, Neurocomputing . 2016,第JANa22PTaB期

机译：使用词嵌入聚类和卷积神经网络进行语义扩展以改善短文本分类
2. Text Semantic Classification of Long Discourses Based on Neural Networks with Improved Focal Loss [J] . Dan Jiang, Jin He Computational intelligence and neuroscience . 2021,第a期

机译：基于神经网络的神经网络文本语义分类，改善焦损
3. Cross-lingual event-centered news clustering based on elements semantic correlations of different news [J] . Hong Xudong, Yu Zhengtao, Tang Moming, Multimedia Tools and Applications . 2017,第23期

机译：基于不同新闻元素语义相关性的跨语言事件中心新闻聚类
4. Semantic Correlation Network Based Text Clustering [C] . Shaoxu Song, Chunping Li Australian Joint Conference on Artificial Intelligence; 20051205-09; Sydney(AU) . 2005

机译：基于语义相关网络的文本聚类
5. Semantic preserving text representation and its applications in text clustering. [D] . Howard, Michael. 2012

机译：语义保留文本表示及其在文本聚类中的应用。
6. Text Semantic Classification of Long Discourses Based on Neural Networks with Improved Focal Loss [O] . Dan Jiang, Jin He 2021

机译：基于神经网络的神经网络文本语义分类改善焦损
7. Distributionally Extended Network-based Word Sense Disambiguation in Semantic Clustering of Polish Texts [O] . Kędzia Paweł, Piasecki Maciej, Kocoń Jan, 2014

机译：波兰语文本语义聚类中基于分布扩展网络的词义消歧

Semantic Correlation Network Based Text Clustering

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅