Application Research on Latent Semantic Analysis for Information Retrieval

机译：信息检索潜在语义分析的应用研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The basic principle of Classic traditional information retrieval model is the machine matching of the key word, namely retrieval based on keywords. This paper proposes a pre-clustering-based latent semantic analysis algorithm for document retrieval. The algorithm can solve the problem of time consuming computation of the similarity between the query vector and each text vector in the traditional latent semantic algorithm for document retrieval. It first clusters the documents using k-means clustering based on the latent semantic analysis, finds out the central point of each cluster, and then calculates the similarity between the query vector and each cluster's central points for retrieval. In view of the characteristics of document retrieval, it proposes a new method for calculating the feature weights and adopts the method of pre-clustering to preprocess document collection. The results of the experiment show that the new algorithm can reduce the search time, and improve the retrieval efficiency.

机译：经典传统信息检索模型的基本原理是关键词的机器匹配，即基于关键字检索。本文提出了一种基于预聚类的潜在语义分析算法，用于文档检索。该算法可以解决文档检索中传统潜在语义算法中查询向量与每个文本向量之间的相似性计算的耗时的问题。首先使用基于潜在语义分析的K-means群集委托的文档，找到每个群集的中心点，然后计算查询向量和每个群集的中央点之间的相似性进行检索。鉴于文档检索的特征，提出了一种用于计算特征权重的新方法，并采用预处理预处理文件集合的方法。实验结果表明，新算法可以降低搜索时间，提高检索效率。

著录项

来源
《International Conference on Measuring Technology and Mechatronics Automation》|2016年||共4页
会议地点
作者
Chen Wenli;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP216;
关键词
Document Retrieval; Latent Semantic Analysis; Singular Value Decomposition; k-means;

机译：文档检索;潜在语义分析;奇异值分解;K-means;

相似文献

外文文献
中文文献
专利

1. Analysis of Vector Space Model, Latent Semantic Indexing and Formal Concept Analysis for Information Retrieval [J] . Cybernetics and information technologies: CIT . 2012,第1期

机译：向量空间模型分析，潜在语义索引和信息检索的形式概念分析
2. Prior-based probabilistic latent semantic analysis for multimedia retrieval [J] . Fernandez-Beltran Ruben, Pla Filiberto Multimedia Tools and Applications . 2018,第13期

机译：用于多媒体检索的基于先验的概率潜在语义分析
3. Analysis on the use of Latent Semantic Indexing (LSI) for document classification and retrieval system of PNP files [J] . Angelica M. Aquino, Enrico P. Chavez MATEC Web of Conferences . 2018,第3期

机译：分析潜在语义索引（LSI）在PNP文件的文件分类和检索系统中的使用
4. Application Research on Latent Semantic Analysis for Information Retrieval [C] . Chen Wenli International Conference on Measuring Technology and Mechatronics Automation . 2016

机译：潜在语义分析在信息检索中的应用研究
5. Latent semantic analysis as a method of content-based image retrieval in medical applications. [D] . Makovoz, Gennadiy. 2010

机译：潜在语义分析作为医学应用中基于内容的图像检索方法。
6. Application of latent semantic analysis for open-ended responses in a large epidemiologic study [O] . Travis D Leleu, Isabel G Jacobson, Cynthia A LeardMann, 2011

机译：潜在语义分析在大型流行病学研究中对开放式响应的应用
7. Analysis on the use of Latent Semantic Indexing (LSI) for document classification and retrieval system of PNP files [O] . Angelica M. Aquino, Enrico P. Chavez 2018

机译：PNP文件文档分类和检索系统潜在语义索引（LSI）的使用分析

Application Research on Latent Semantic Analysis for Information Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅