【24h】

The limitation of the SVD for latent semantic indexing

机译:SVD对潜在语义索引的局限性

获取原文

摘要

Latent semantic indexing (LSI) is an indexing method for improving retrieval performance of an information retrieval system by grouping related documents to the same clusters so that each of these documents indexes the same (or almost the same) words, and unrelated documents index (relatively) different words. The de facto standard method for LSI is the truncated singular value decomposition (SVD). In this paper, we show that the LSI capability of the truncated SVD is not as conclusive as previously reported; rather it is a conditional aspect and when the condition is not met, then the truncated SVD can fail in recognizing the related documents resulting in a poor retrieval performance.
机译:潜在语义索引(LSI)是一种索引方法,用于通过将相关文档分组到相同的群集来提高信息检索系统的检索性能,使得这些文档中的每一个索引相同(或几乎相同)的单词和不相关的文档索引(相对)不同的单词。 LSI的事实标准方法是截断的奇异值分解(SVD)。在本文中,我们表明,如前所述,截短的SVD的LSI能力不像先前的结论;相反,它是一个条件方面,并且当不满足条件时,截短的SVD可以失败,以识别相关文件,导致检索性能差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号