【24h】

Fine-Grained Class Label Markup of Search Queries

机译:搜索查询的细分类标签标记

获取原文

摘要

We develop a novel approach to the semantic analysis of short text segments and demonstrate its utility on a large corpus of Web search queries. Extracting meaning from short text segments is difficult as there is little semantic redundancy between terms; hence methods based on shallow semantic analysis may fail to accurately estimate meaning. Furthermore search queries lack explicit syntax often used to determine intent in question answering. In this paper we propose a hybrid model of semantic analysis combining explicit class-label extraction with a latent class PCFG. This class-label correlation (CLC) model admits a robust parallel approximation, allowing it to scale to large amounts of query data. We demonstrate its performance in terms of (1) its predicted label accuracy on polysemous queries and (2) its ability to accurately chunk queries into base constituents.
机译:我们开发了一种新颖的方法来对短文本段进行语义分析,并在大量的Web搜索查询中展示了其实用性。从短文本段中提取含义很困难,因为术语之间几乎没有语义冗余。因此,基于浅层语义分析的方法可能无法准确估计含义。此外,搜索查询缺少通常用于确定问题解答意图的显式语法。在本文中,我们提出了一种语义分析的混合模型,该模型将显式类标签提取与潜在类PCFG相结合。该类标签相关性(CLC)模型允许使用鲁棒的并行近似,从而可以扩展到大量查询数据。我们通过(1)多义查询的预测标签准确性和(2)将查询准确地分块为基本成分的能力来证明其性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号