首页> 外文会议>String Processing and Information Retrieval; Lecture Notes in Computer Science; 4209 >Incremental Aggregation of Latent Semantics Using a Graph-Based Energy Model
【24h】

Incremental Aggregation of Latent Semantics Using a Graph-Based Energy Model

机译:基于图的能量模型的潜在语义增量聚合

获取原文
获取原文并翻译 | 示例

摘要

A graph-theoretic model for incrementally detecting latent associations among terms in a document corpus is presented. The algorithm is based on an energy model that quantifies similarity in context between pairs of terms. Latent associations that are established in turn contribute to the energy of their respective contexts. The proposed model avoids the polysemy problem where spurious associations across terms in different contexts are established due to the presence of one or more common polysemic terms. The algorithm works in an incremental fashion where energy values are adjusted after each document is added to the corpus. This has the advantage that computation is localized around the set of terms contained in the new document, thus making the algorithm run much faster than conventional matrix computations used for singular value decompositions.
机译:提出了一种用于逐步检测文档语料库中各个词之间的潜在关联的图论模型。该算法基于一个能量模型,该模型量化了术语对之间的上下文相似性。反过来建立的潜在关联会为其各自上下文提供能量。所提出的模型避免了由于存在一个或多个公共多义词而在不同上下文中建立了跨词的虚假关联的多义问题。该算法以增量方式工作,其中在将每个文档添加到语料库之后调整能量值。这样做的好处是,计算可以围绕新文档中包含的一组术语进行定位,从而使算法的运行速度比用于奇异值分解的常规矩阵计算快得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号