Application Research on Latent Semantic Analysis for Information Retrieval

机译：潜在语义分析在信息检索中的应用研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The basic principle of Classic traditional information retrieval model is the machine matching of the key word, namely retrieval based on keywords. This paper proposes a pre-clustering-based latent semantic analysis algorithm for document retrieval. The algorithm can solve the problem of time consuming computation of the similarity between the query vector and each text vector in the traditional latent semantic algorithm for document retrieval. It first clusters the documents using k-means clustering based on the latent semantic analysis, finds out the central point of each cluster, and then calculates the similarity between the query vector and each cluster's central points for retrieval. In view of the characteristics of document retrieval, it proposes a new method for calculating the feature weights and adopts the method of pre-clustering to preprocess document collection. The results of the experiment show that the new algorithm can reduce the search time, and improve the retrieval efficiency.

机译：经典传统信息检索模型的基本原理是关键词的机器匹配，即基于关键词的检索。提出了一种基于聚类的潜在语义分析算法，用于文档检索。该算法可以解决传统的潜在语义检索文档算法中查询向量与每个文本向量之间相似度计算耗时的问题。它首先基于潜在语义分析，使用k均值聚类对文档进行聚类，找出每个聚类的中心点，然后计算查询向量与每个聚类的中心点之间的相似度以进行检索。针对文献检索的特点，提出了一种计算特征权重的新方法，并采用了预聚类的方法对文献进行预处理。实验结果表明，新算法可以减少搜索时间，提高检索效率。

著录项

来源
《International Conference on Measuring Technology and Mechatronics Automation》|2016年|118-121|共4页
会议地点
作者
Chen Wenli;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Document Retrieval; Latent Semantic Analysis; Singular Value Decomposition; k-means;

机译：文档检索;潜在语义分析;奇异值分解; k-均值;

相似文献

外文文献
中文文献
专利

1. Analysis of Vector Space Model, Latent Semantic Indexing and Formal Concept Analysis for Information Retrieval [J] . Cybernetics and information technologies: CIT . 2012,第1期

机译：向量空间模型分析，潜在语义索引和信息检索的形式概念分析
2. Prior-based probabilistic latent semantic analysis for multimedia retrieval [J] . Fernandez-Beltran Ruben, Pla Filiberto Multimedia Tools and Applications . 2018,第13期

机译：用于多媒体检索的基于先验的概率潜在语义分析
3. Analysis on the use of Latent Semantic Indexing (LSI) for document classification and retrieval system of PNP files [J] . Angelica M. Aquino, Enrico P. Chavez MATEC Web of Conferences . 2018,第3期

机译：分析潜在语义索引（LSI）在PNP文件的文件分类和检索系统中的使用
4. Application Research on Latent Semantic Analysis for Information Retrieval [C] . Chen Wenli International Conference on Measuring Technology and Mechatronics Automation . 2016

机译：信息检索潜在语义分析的应用研究
5. Latent semantic analysis as a method of content-based image retrieval in medical applications. [D] . Makovoz, Gennadiy. 2010

机译：潜在语义分析作为医学应用中基于内容的图像检索方法。
6. Application of latent semantic analysis for open-ended responses in a large epidemiologic study [O] . Travis D Leleu, Isabel G Jacobson, Cynthia A LeardMann, 2011

机译：潜在语义分析在大型流行病学研究中对开放式响应的应用
7. Analysis on the use of Latent Semantic Indexing (LSI) for document classification and retrieval system of PNP files [O] . Angelica M. Aquino, Enrico P. Chavez 2018

机译：PNP文件文档分类和检索系统潜在语义索引（LSI）的使用分析

Application Research on Latent Semantic Analysis for Information Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅