【24h】

Unified Semantic Parsing with Weak Supervision

机译:统一的语义解析弱监督

获取原文
获取外文期刊封面目录资料

摘要

Semantic parsing over multiple knowledge bases enables a parser to exploit structural similarities of programs across the multiple domains. However, the fundamental challenge lies in obtaining high-quality annotations of (utterance, program) pairs across various domains needed for training such models. To overcome this, we propose a novel framework to build a unified multi-domain enabled semantic parser trained only with weak supervision (denotations). Weakly supervised training is particularly arduous as the program search space grows exponentially in a multi-domain setting. To solve this, we incorporate a multi-policy distillation mechanism in which we first train domain-specific semantic parsers (teachers) using weak supervision in the absence of the ground truth programs, followed by training a single unified parser (student) from the domain specific policies obtained from these teachers. The resultant semantic parser is not only compact but also generalizes better, and generates more accurate programs. It further does not require the user to provide a domain label while querying. On the standard OVERNIGHT dataset (containing multiple domains), we demonstrate that the proposed model improves performance by 20% in terms of denotation accuracy in comparison to baseline techniques.
机译:通过多个知识库的语义解析使解析器能够在多个域中利用程序的结构相似之处。然而,基本挑战在于在培训此类模型所需的各个域中获得高质量注释(话语,程序)对。为了克服这一点,我们提出了一种新颖的框架来构建一个统一的多域启用的语义解析器,仅涉及弱监督(表示)。由于程序搜索空间在多域设置中呈指数增长,弱监督培训特别艰巨。为了解决这个问题,我们纳入了一种多政策蒸馏机制,我们首先在没有地面真理计划的情况下使用弱监管培训特定于域的语义解毒(教师),然后通过域名培训一个统一的解析器(学生)从这些教师获得的具体政策。结果语义解析器不仅是紧凑的,而且还更好地概括,并产生更准确的程序。它还不需要用户在查询时提供域标签。在标准过夜数据集(包含多个域),我们证明,与基线技术相比,所提出的模型在表示精度方面提高了20%的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号