首页> 外文会议>CIKM 10;ACM conference on information and knowledge management >Hypergraph-based Multilevel Matrix Approximation for Text Information Retrieval
【24h】

Hypergraph-based Multilevel Matrix Approximation for Text Information Retrieval

机译:基于超图的文本信息检索多级矩阵逼近

获取原文

摘要

In Latent Semantic Indexing (LSI), a collection of documents is often pre-processed to form a sparse term-document matrix, followed by a computation of a low-rank approximation to the data matrix. A multilevel framework based on hypergraph coarsening is presented which exploits the hypergraph that is canonically associated with the sparse term-document matrix representing the data. The main goal is to reduce the cost of the matrix approximation without sacrificing accuracy. Because coarsening by multilevel hypergraph techniques is a form of clustering, the proposed approach can be regarded as a hybrid of factorization-based LSI and clustering-based LSI. Experimental results indicate that our method achieves good improvement of the retrieval performance at a reduced cost.
机译:在潜在语义索引(LSI)中,通常对文档集合进行预处理以形成稀疏的术语文档矩阵,然后对数据矩阵进行低秩近似计算。提出了一种基于超图粗糙化的多级框架,该框架利用与代表数据的稀疏术语文档矩阵规范关联的超图。主要目标是在不牺牲精度的情况下降低矩阵近似的成本。由于通过多级超图技术进行粗化是集群的一种形式,因此该方法可被视为基于因子分解的LSI和基于集群的LSI的混合。实验结果表明,我们的方法以较低的成本实现了检索性能的良好提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号