【24h】

Fast Computation of Lexical Affinity Models

机译:词汇亲和模型的快速计算

获取原文
获取原文并翻译 | 示例

摘要

We present a framework for the fast computation of lexical affinity models. The framework is composed of a novel algorithm to efficiently compute the co-occurrence distribution between pairs of terms, an independence model, and a parametric affinity model. In comparison with previous models, which either use arbitrary windows to compute similarity between words or use lexical affinity to create sequential models, in this paper we focus on models intended to capture the co-occurrence patterns of any pair of words or phrases at any distance in the corpus. The framework is flexible, allowing fast adaptation to applications and it is scalable. We apply it in combination with a terabyte corpus to answer natural language tests, achieving encouraging results.
机译:我们提出了一种快速计算词汇相似性模型的框架。该框架由一种新颖的算法组成,可以有效地计算词语对之间的共现分布,独立性模型和参数相似性模型。与以前的模型相比,以前的模型要么使用任意窗口来计算单词之间的相似度,要么使用词法相似性来创建顺序模型,在本文中,我们将重点放在旨在捕获任意距离的任何单词或短语对的共现模式的模型上。在语料库中。该框架非常灵活,可以快速适应应用程序,并且具有可扩展性。我们将其与TB语料库结合使用来回答自然语言测试,取得令人鼓舞的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号