首页> 外文会议>IEEE Congress on Evolutionary Computation >Using Semantic Similarity Matrix for Defining Operations involved in NTSO for Clustering 20 News Groups

【24h】

Using Semantic Similarity Matrix for Defining Operations involved in NTSO for Clustering 20 News Groups

机译：使用语义相似性矩阵定义群体群集20个新闻组中涉及的操作

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this research, we propose the similarity matrix based version of NTSO as the approach to the text clustering. For using one of traditional approaches to text clustering, documents should be encoded into numerical vectors; encoding so causes the two main problems: the huge dimensionality and the sparse distribution. In order to solve the problems, in this research, we propose to encode documents into string vectors and use the NTSO (Neural Text Self Organization) as the string vector based neural network for the text clustering. By encoding documents into another form, we attempt to avoid the two main problems, completely. As the empirical validation, the proposed approach will be compared with others with respect to the clustering performance and speed.

机译：在这项研究中，我们提出了基于Matrix的NTSO版本作为文本群集的方法。对于使用传统方法之一进行文本聚类，应将文档编码为数字向量;编码所以导致两个主要问题：巨大的维度和稀疏分布。为了解决问题，在本研究中，我们建议将文档编码为串向量，并使用NTSO（神经文本自组织）作为文本群集的基于串向量的神经网络。通过将文件编码为另一种形式，我们试图避免完全避免两个主要问题。作为经验验证，将与其他人相比，将拟议的方法与集群性能和速度进行比较。

著录项

来源
《IEEE Congress on Evolutionary Computation》|2010年||共6页
会议地点
作者
Taeho Jo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP301.6-53;
关键词

相似文献

外文文献
中文文献
专利

1. Paraphrase identification and semantic text similarity analysis in Arabic news tweets using lexical, syntactic, and semantic features [J] . Mohammad AL-Smadi, Zain Jaradat, Mahmoud AL-Ayyoub, Information Processing & Management . 2017,第3期

机译：使用词汇，句法和语义特征的阿拉伯新闻推文中的释义识别和语义文本相似性分析
2. An Efficient Approach for Ranking of Semantic Web Documents by Computing Semantic Similarity and Using HCS Clustering [J] . Poonam Chahal, Manjeet Singh International journal of signs and semiotic systems . 2021,第1期

机译：通过计算语义相似性和使用HCS群集来进行语义Web文档的高效方法
3. Clustering for semantic purposes: Exploration of semantic similarity in a technical corpus [J] . Ann Bertels, Dirk Speelman Terminology . 2014,第2期

机译：出于语义目的的聚类：技术语料库中语义相似性的探索
4. Using semantic similarity matrix for defining operations involved in NTSO for clustering 20NewsGroups [C] . Jo Taeho IEEE Congress on Evolutionary Computation . 2010

机译：使用语义相似性矩阵定义NTSO中涉及的操作以对20NewsGroups进行聚类
5. Categorizer: a tool to categorize genes into user-defined biological groups based on semantic similarity [O] . Dokyun Na, Hyungbin Son, Jörg Gsponer 2014

机译：分类器：基于语义相似度将基因分类为用户定义的生物组的工具
6. Clustering for Semantic Purposes: Exploration of Semantic Similarity in a Technical Corpus [O] . Bertels Ann, Speelman Dirk 2014

机译：出于语义目的的聚类：技术语料库中语义相似性的探索
7. Comparison of Human and Latent Semantic Analysis (LSA) Judgements of Pairwise Document Similarities for a News Corpus [R] . Pincombe, B. 2004

机译：新闻语料库中两两文档相似度的人类和潜在语义分析（Lsa）判断的比较

Using Semantic Similarity Matrix for Defining Operations involved in NTSO for Clustering 20 News Groups

摘要

著录项

相似文献

相关主题

期刊订阅