Dimension Reduction by Word Clustering with Semantic Distance

机译：用语义距离的单词聚类减少尺寸

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In information retrieval, Latent Semantic Analysis (LSA) is a method to handle large and sparse document vectors. LSA reduces the dimension of document vectors by producing a set of topics related to the documents and terms statistically. Therefore, it needs a certain number of words and takes no account of semantic relations of words. In this paper, by clustering the words using semantic distances of words, the dimension of document vectors is reduced to the number of word-clusters. Word distance is able to be calculated by using WordNet. This method is free from the amount of words and documents. For especially small documents, we use word's definition in a dictionary and calculate the similarities between documents.

机译：在信息检索中，潜在语义分析（LSA）是处理大型和稀疏文档向量的方法。 LSA通过统计上的文档和术语产生一系列主题来减少文档向量的维度。因此，它需要一定数量的单词，并且没有考虑单词的语义关系。在本文中，通过使用单词的语义距离聚类单词，文档向量的维度降低到单词簇的数量。单词距离能够通过使用WordNet来计算。此方法没有单词和文档的数量。对于尤其是小文件，我们在字典中使用Word的定义并计算文档之间的相似之处。

著录项

来源
《IEEE/ACIS International Conference on Big Data, Cloud Computing, Data Science and Engineering》|2020年|xiii 214 pages :|共15页
会议地点
作者
Toshinori Deguchi; Naohiro Ishii;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13-532;
关键词
Dimension Reduction; Word Clustering; Semantic Distance;

机译：减少尺寸;单词聚类;语义距离;

相似文献

外文文献
中文文献
专利

1. EMPIRICAL ANALYSIS OF THE EFFECT OF DIMENSION REDUCTION AND WORD ORDER ON SEMANTIC VECTORS [J] . LAURIANNE SITBON, PETER D. BRUZA, CHRISTIAN PROKOPP International journal of semantic computing . 2012,第3期

机译：降维和词序对语义向量的影响的实证分析
2. Universal Dimensions of Meaning Derived from Semantic Relations among Words and Senses: Mereological Completeness vs. Ontological Generality [J] . Alexei V. Samsonovich, Giorgio A. Ascoli Computation . 2014,第3期

机译：从单词和感觉之间的语义关系中得出的意义的普遍维度：记忆完整性与本体普遍性
3. Using a high-dimensional graph of semantic space to model relationships among words [J] . Alice F. Jackson, Donald J. Bolger Frontiers in Psychology . 2014,第4期

机译：使用语义空间的高维图来建模单词之间的关系
4. Dimension Reduction by Word Clustering with Semantic Distance [C] . Toshinori Deguchi, Naohiro Ishii IEEE/ACIS International Conference on Big Data, Cloud Computing, Data Science and Engineering . 2020

机译：用语义距离的单词聚类减少尺寸
5. Multiple alternative clusterings and dimensionality reduction. [D] . Niu, Donglin. 2012

机译：多个替代聚类和降维。
6. Using a high-dimensional graph of semantic space to model relationships among words [O] . Alice F. Jackson, Donald J. Bolger 2014

机译：使用语义空间的高维图来建模单词之间的关系
7. Dimensionality Reduction for Distance Based Video Clustering [O] . Thiagarajan, Jayaraman J., Ramamurthy, Karthikeyan N., Spanias, Andreas 2010

机译：基于距离的视频聚类的降维

Dimension Reduction by Word Clustering with Semantic Distance

摘要

著录项

相似文献

相关主题

期刊订阅