【24h】

Latent Semantic Analysis: An Approach to Understand Semantic of Text

机译:潜在语义分析:一种理解文本语义的方法

获取原文
获取原文并翻译 | 示例

摘要

Latent semantic analysis (LSA) is a method for analyzing a piece of text with certain mathematical computation and analyzing relationship between terms in the documents, between the documents in the corpus.Various application of intelligent information retrieval, search engines, internet news sites requires an accurate method of accessing document similarity in order to carry out classification, clustering, summarizing or search tasks. So in this paper we are studying latent semantic analysis based on single value decomposition. The aim of Latent semantic analysis is to exploit the global structure of documents. The emphasis of latent semantic analysis is to find hidden relationship in document for better understanding the relationship between terms and document in dataset. In this paper, we have conducting a study using Latent semantic analysis (LSA) to find correlation of terms in a dataset consisting of research papers of various natural language processing applications.LSA shows that single value decomposition collapse multiple terms with same semantic and can identify terms with multiple meaning and represent documents in lower dimensional conceptual space.
机译:潜在语义分析(LSA)是一种通过一定的数学计算来分析一段文本并分析文档中术语之间,语料库中文档之间的关系的方法。智能信息检索,搜索引擎,互联网新闻站点的各种应用都需要一种访问文档相似性以执行分类,聚类,汇总或搜索任务的准确方法。因此,本文研究基于单值分解的潜在语义分析。潜在语义分析的目的是利用文档的全局结构。潜在语义分析的重点是在文档中找到隐藏的关系,以更好地理解数据集中术语与文档之间的关系。在本文中,我们进行了一项使用潜在语义分析(LSA)的研究,以在由各种自然语言处理应用程序的研究论文组成的数据集中找到术语的相关性.LSA表明,单值分解会折叠具有相同语义的多个术语并可以识别具有多种含义的术语,并在较低维度的概念空间中表示文档。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号