首页> 外文会议>International joint conference on artificial intelligence;IJCAI-11 >An Efficient Framework for Constructing Generalized Locally-Induced Text Metrics
【24h】

An Efficient Framework for Constructing Generalized Locally-Induced Text Metrics

机译:构建广义本地诱导文本度量的有效框架

获取原文

摘要

In this paper, we propose a new framework for constructing text metrics which can be used to compare and support inferences among terms and sets of terms. Our metric is derived from data-driven kernels on graphs that let us capture global relations among terms and sets of terms, regardless of their complexity and size. To compute the metric efficiently for any two subsets of terms, we develop an approximation technique that relies on the precompiled term-term similarities. To scale-up the approach to problems with huge number of terms, we develop and experiment with a solution that sub-samples the term space. We demonstrate the benefits of the whole framework on two text inference tasks: prediction of terms in the article from its abstract and query expansion in information retrieval.
机译:在本文中,我们提出了一个用于构造文本度量的新框架,该框架可用于比较和支持术语和术语集之间的推论。我们的指标源自图上数据驱动的内核,这些内核使我们能够捕获术语和术语集之间的全局关系,而不论它们的复杂性和大小如何。为了有效地计算任意两个子集的度量,我们开发了一种近似技术,该技术依赖于预编译的术语-术语相似度。为了扩大解决包含大量术语的问题的方法,我们开发并尝试了对术语空间进行二次采样的解决方案。我们在两个文本推理任务上展示了整个框架的好处:从摘要中预测文章中的术语以及在信息检索中进行查询扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号