首页> 外文会议>International Conference on String Processing and Information Retrieval >Incremental Aggregation of Latent Semantics Using a Graph-Based Energy Model
【24h】

Incremental Aggregation of Latent Semantics Using a Graph-Based Energy Model

机译:基于图形的能量模型的潜在语义的增量聚合

获取原文

摘要

A graph-theoretic model for incrementally detecting latent associations among terms in a document corpus is presented. The algorithm is based on an energy model that quantifies similarity in context between pairs of terms. Latent associations that are established in turn contribute to the energy of their respective contexts. The proposed model avoids the polysemy problem where spurious associations across terms in different contexts are established due to the presence of one or more common polysemic terms. The algorithm works in an incremental fashion where energy values are adjusted after each document is added to the corpus. This has the advantage that computation is localized around the set of terms contained in the new document, thus making the algorithm run much faster than conventional matrix computations used for singular value decompositions.
机译:提出了一种图形 - 理论模型,用于递增检测文档语料库中的术语之间的潜在关联。该算法基于能量模型,该能量模型在术语对之间量化上下文中的相似度。反向建立的潜在协会有助于其各自背景的能量。所提出的模型避免了多义问题,其中由于存在一种或多种常见的多种多态性术语,建立了不同上下文中的虚假关联。该算法以增量方式工作,其中在将每个文档添加到语料库后调整能量值。这具有以下优点,即计算在新文档中包含的一组术语周围本地化,从而使算法比用于奇异值分解的传统矩阵计算快得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号