Capturing the semantic structure of documents using summaries in Supplemented Latent Semantic Analysis

KARTHIK KRISHNAMURTHI; VIJAYAPAL REDDY PANUGANTI GRIET; VISHNU VARDHAN BULUSU

首页> 外文期刊>WSEAS Transactions on Computers >Capturing the semantic structure of documents using summaries in Supplemented Latent Semantic Analysis

【24h】

Capturing the semantic structure of documents using summaries in Supplemented Latent Semantic Analysis

机译：使用补充潜在语义分析中的摘要捕获文档的语义结构

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Latent Semantic Analysis (LSA) is a mathematical technique that is used to capture the semantic structure of documents based on correlations among textual elements within them. Summaries of documents contain words that actually contribute towards the concepts of documents. In the present work, summaries are used in LSA along with supplementary information such as document category and domain information in the model. This modification is referred as Supplemented Latent Semantic Analysis (SLSA) in this paper. SLSA is used to capture the semantic structure of documents using summaries of various proportions instead of entire full-length documents. The performance of SLSA on summaries is empirically evaluated in a document classification application by comparing the accuracies of classification against plain LSA on full-length documents. It is empirically shown that instead of using full-length documents, their summaries can be used to capture the semantic structure of documents.

机译：潜在语义分析（LSA）是一种数学技术，用于根据文档中文本元素之间的相关性来捕获文档的语义结构。文档摘要中包含实际上有助于文档概念的词语。在当前工作中，摘要在LSA中与模型中的文档类别和域信息等补充信息一起使用。该修改在本文中称为“补充潜在语义分析（SLSA）”。 SLSA用于使用各种比例的摘要而不是整个全长文档来捕获文档的语义结构。在文档分类应用程序中，通过将分类准确性与在全长文档中的普通LSA进行比较，经验地评估了SLSA在摘要上的性能。从经验上可以看出，代替使用全长文档，可以使用摘要来捕获文档的语义结构。

著录项

来源
《WSEAS Transactions on Computers 》 |2015年第null期| 共10页
作者
KARTHIK KRISHNAMURTHI; VIJAYAPAL REDDY PANUGANTI GRIET; VISHNU VARDHAN BULUSU;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术 ;
关键词
Dimensionality Reduction; Document Classification; Latent Semantic Analysis; Semantic Structure; Singular Value Decomposition;

机译：降维;文档分类;潜在语义分析;语义结构;奇异值分解;

相似文献

外文文献
中文文献
专利

1. Capturing the semantic structure of documents using summaries in Supplemented Latent Semantic Analysis [J] . KARTHIK KRISHNAMURTHI, VIJAYAPAL REDDY PANUGANTI GRIET, VISHNU VARDHAN BULUSU WSEAS Transactions on Computers . 2015 ,第Null期

机译：使用补充潜在语义分析中的摘要捕获文档的语义结构
2. COMPARISON OF LATENT SEMANTIC ANALYSIS AND PROBABILISTIC LATENT SEMANTIC ANALYSIS FOR DOCUMENTS CLUSTERING [J] . Marcin Kuta, Jacek Kitowski Computing and informatics . 2014 ,第3期

机译：文档聚类的潜在语义分析和概率潜在语义分析的比较
3. Including category information as supplements in latent semantic analysis of Hindi documents [J] . Karthik Krishnamurthi, Vijayapal Reddy Panuganti, Vishnu Vardhan Bulusu International Journal of Computational Science and Engineering . 2017 ,第1a2期

机译：包括类别信息作为印地文文件潜在语义分析的补充
4. An Effective Approach of Extracting Local Documents from the Distributed Representation of Text using Document Embedding and Latent Semantic Analysis [C] . Vikas Chib, Ahsan Jafri International Conference on Smart Systems and Inventive Technology . 2019

机译：利用文档嵌入和潜在语义分析从文本的分布式表示中提取本地文档的有效方法
5. Generalized latent semantic analysis for document representation [D] . Matveeva, Irina 2008

机译：用于文档表示的广义潜在语义分析
6. Latent Semantic Indexing of medical diagnoses using UMLS semantic structures. [O] . C. G. Chute, Y. Yang, D. A. Evans 1991

机译：使用UMLS语义结构对医学诊断进行潜在语义索引。
7. Development of a computer system for generating semantic template of a group of documents by using latent semantic analysis [O] . Yuriy Taranenko, Maryna Kabanova 2016

机译：开发用于通过使用潜在语义分析生成一组文档的语义模板的计算机系统
8. Comparison of Human and Latent Semantic Analysis (LSA) Judgements of Pairwise Document Similarities for a News Corpus [R] . Pincombe, B. 2004

机译：新闻语料库中两两文档相似度的人类和潜在语义分析（Lsa）判断的比较

Capturing the semantic structure of documents using summaries in Supplemented Latent Semantic Analysis

摘要

著录项

相似文献

相关主题

期刊订阅