Use of the EPSILON Decomposition and the SVD Based LSI Techniques for Reduction of the Large Indexing Structures

机译：使用epsilon分解和基于SVD的LSI技术来减少大型分度结构

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Storage of indexing structures in the Vector Space Model (VSM) form has a number of advantages. In the case when text documents are considered, the indexing structure states the Term-By-Document (TBD) matrix. Its size is proportional to the product of the indexed documents number and the keywords number. In the case of large text documents databases, the size of the indexing structure is a serious limitation. Too large TBD matrix may not be able to be stored in memory or the process of searching for documents may take too much time. The article presents a methodology that allows to reduce the size of the large TBD matrix. The operation performed on the TBD matrix is the Singular Value Decomposition (SVD). It allows to transform the original indexing structure vectors into a space with fewer dimensions. As a result of the operation, keywords used in the indexing process are generalized. This is a desirable effect, methods for generalizing the keywords are called the Latent Sematic Indexing (LSI) methods. Despite the undeniable advantages of the SVD decomposition, it has a big disadvantage. Its computational complexity is O(n~3). In practice, this prevents the application of the method to a large indexing structure. The methodology presented in the article assumes the use of the Epsilon decomposition in order to divide the original TBD matrix into parts before the reduction process. The proposed modification allows the use of the SVD decomposition for the indexing structure of any size.

机译：在向量空间模型（VSM）形式中存储索引结构具有许多优点。在考虑文本文档时，索引结构排列了逐个文档（TBD）矩阵。其大小与索引文档编号和关键字编号的乘积成比例。在大文本文档数据库的情况下，索引结构的大小是严重的限制。 TBD矩阵太大可能无法存储在存储器中，或者搜索文档的过程可能需要太多时间。该物品呈现了一种方法，允许减小大TBD矩阵的大小。在TBD矩阵上执行的操作是奇异值分解（SVD）。它允许将原始索引结构向量转换为具有较少维度的空间。作为操作的结果，索引过程中使用的关键字是概括的。这是一个理想的效果，用于概括关键字的方法称为潜在语义索引（LSI）方法。尽管SVD分解的不可否认的优势，但它具有很大的缺点。其计算复杂性是O（n〜3）。在实践中，这可以防止该方法的应用到大型索引结构。在文章中呈现的方法假设使用ε分解，以便将原始TBD矩阵分成还原过程之前的部分。所提出的修改允许使用SVD分解进行任何尺寸的索引结构。

著录项

来源
《International Conference on Information Systems Architecture and Technology》|2019年|xvi 384 pages :|共12页
会议地点
作者
Damian Raczyński; W?odzimierz Stanislawski;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-532;
关键词
Epsilon decomposition; Dimensional reduction; Application of the SVD decomposition; Latent Semantic Indexing LSI; Large indexing structures reduction; Retrieval systems algorithms; Retrieval systems design;

机译：epsilon分解;尺寸减少;SVD分解的应用;潜在语义索引LSI;大型索引结构减少;检索系统算法;检索系统设计;

相似文献

外文文献
中文文献
专利

1. Electrocardiogram signal compression based on singular value decomposition (SVD) and adaptive scanning wavelet difference reduction (ASWDR) technique [J] . Kumar Ranjeet, Kumar A., Singh G. K. AEU: Archiv fur Elektronik und Ubertragungstechnik: Electronic and Communication . 2015,第12期

机译：基于奇异值分解（SVD）和自适应扫描小波差异减少（ASWDR）技术的心电图信号压缩
2. Empirical mode decomposition based filtering techniques for power line interference reduction in electrocardiogram using various adaptive structures and subtraction methods [J] . M. Suchetha, N. Kumaravel Biomedical signal processing and control . 2013,第6期

机译：基于经验模式分解的滤波技术，可通过各种自适应结构和减法来减少心电图中的电源线干扰
3. Effect of wavelet based image fusion techniques with principal component analysis (PCA) and singular value decomposition (SVD) in supervised classification [J] . Sulochana S., Vidhya R., Mohanraj K., Indian Journal of Marine Sciences . 2017,第2期

机译：基于小波的图像融合技术与主成分分析（PCA）和奇异值分解（SVD）在监督分类中的作用
4. Use of the EPSILON Decomposition and the SVD Based LSI Techniques for Reduction of the Large Indexing Structures [C] . Damian Raczyński, W?odzimierz Stanislawski International Conference on Information Systems Architecture and Technology . 2019

机译：使用epsilon分解和基于SVD的LSI技术来减少大型分度结构
5. Early stage analysis of microarray data using ICA and SVD matrix decomposition techniques. [D] . Survery, Burhan ur Rehman Khan. 2004

机译：使用ICA和SVD矩阵分解技术对微阵列数据进行早期分析。
6. SVD-Based Technique for Interference Cancellation and Noise Reduction in NMR Measurement of Time-Dependent Magnetic Fields [O] . Wenjun Chen, Hong Ma, De Yu, 2016

机译：基于SVD的时变磁场NMR测量中的干扰消除和降噪技术
7. SVD-Based Technique for Interference Cancellation and Noise Reduction in NMR Measurement of Time-Dependent Magnetic Fields [O] . Wenjun Chen, Hong Ma, De Yu, 2016

机译：基于sVD的时间依赖磁场核磁共振测量中的干扰消除和降噪技术
8. Matrices with low-rank-plus-shift structure: Partial SVD and latent semantic indexing [R] . Zha, H. , Zhang, Z. 1998

机译：具有低秩加移位结构的矩阵：部分sVD和潜在语义索引

Use of the EPSILON Decomposition and the SVD Based LSI Techniques for Reduction of the Large Indexing Structures

摘要

著录项

相似文献

相关主题

期刊订阅