首页> 外文会议>AAAI Conference on Artificial Intelligence >Hierarchical Encoder with Auxiliary Supervision for Neural Table-to-Text Generation: Learning Better Representation for Tables
【24h】

Hierarchical Encoder with Auxiliary Supervision for Neural Table-to-Text Generation: Learning Better Representation for Tables

机译:具有辅助监控的分层编码器,用于神经表到文本生成:学习更好的表格表示表格

获取原文

摘要

Generating natural language descriptions for the structured tables which consist of multiple attribute-value tuples is a convenient way to help people to understand the tables. Most neural table-to-text models are based on the encoder-decoder framework. However, it is hard for a vanilla encoder to learn the accurate semantic representation of a complex table. The challenges are two-fold: firstly, the table-to-text datasets of-ten contain large number of attributes across different domains, thus it is hard for the encoder to incorporate these heterogeneous resources. Secondly, the single encoder also has difficulties in modeling the complex attribute-value structure of the tables. To this end, we first propose a two-level hierarchical encoder with coarse-to-fine attention to handle the attribute-value structure of the tables. Furthermore, to capture the accurate semantic representations of the tables, we propose 3 joint tasks apart from the prime encoder-decoder learning, namely auxiliary sequence labeling task, text autoencoder and multi-labeling classification, as the auxiliary supervisions for the table encoder. We test our models on the widely used dataset Wikibio which contains Wikipedia in-foboxes and related descriptions. The dataset contains complex tables as well as large number of attributes across different domains. We achieve the state-of-the-art performance on both automatic and human evaluation metrics.
机译:生成由多个属性值元组组成的结构化表的自然语言描述是帮助人们理解表的便捷方式。大多数神经表到文本模型都基于编码器解码器框架。但是,Vanilla编码器很难了解复杂表的准确语义表示。挑战是两倍:首先,TO的表格到文本数据集包含不同域的大量属性,因此编码器难以包含这些异构资源。其次,单个编码器在建模表的复杂属性值结构方面也具有困难。为此,我们首先提出了一个双层分层编码器,具有粗略关注,以处理表的属性值结构。此外,要捕获表的准确语义表示,我们提出了3个关节任务,除了Prime编码器 - 解码器学习,即辅助序列标记任务,文本AutoEncoder和多标签分类,作为表编码器的辅助监控。我们在广泛使用的DataSet Wikibio上测试我们的模型,其中包含Wikipedia In-foboxes和相关描述。数据集包含复杂的表以及不同域中的大量属性。我们在自动和人类评估指标上实现了最先进的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号