首页> 外文会议>Australian Joint Conference on Artificial Intelligence >Semantic Correlation Network Based Text Clustering
【24h】

Semantic Correlation Network Based Text Clustering

机译:基于语义关联网络的文本聚类

获取原文

摘要

Text documents have sparse data spaces, and nearest neighbors may belong to different classes when using current existing proximity measures to describe the correlation of documents. In this paper, we propose an asymmetric similarity measure to strengthen the discriminative feature of document objects. We construct a semantic correlation network by asymmetric similarity between documents and conjecture the power law feature of the connections distributions. Hub points which exist in semantic correlation network are classified by an agglomerative hierarchical clustering approach named SCN. Both objects similarity and neighbors similarity are considered in the definition of hub points proximity. Finally, we assign the rest text objects to their nearest hub points. The experimental evaluation on textual data sets demonstrates the validity and efficiency of SCN. The comparison with other clustering algorithms shows the superiority of our approach.
机译:文本文档具有稀疏数据空间,并且当使用当前现有的邻近措施来描述文档的相关性时,最近的邻居可能属于不同的类。在本文中,我们提出了一种不对称的相似度措施,以加强文档对象的鉴别特征。通过文档与猜测连接分布的电力法特征之间的不对称相似性来构建语义相关网络。在语义相关网络中存在的集线点由名为SCN的附名分层聚类方法分类。对象相似性和邻居相似度被认为是在集线器点接近的定义中。最后,我们将REST文本对象分配给最近的集线器点。文本数据集的实验评估展示了SCN的有效性和效率。与其他聚类算法的比较显示了我们方法的优越性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号