首页> 外文会议>Annual meeting of the Association for Computational Linguistics >A Parallel-Hierarchical Model for Machine Comprehension on Sparse Data
【24h】

A Parallel-Hierarchical Model for Machine Comprehension on Sparse Data

机译:稀疏数据机器理解的并行层次模型

获取原文

摘要

Understanding unstructured text is a major goal within natural language processing. Comprehension tests pose questions based on short text passages to evaluate such understanding. In this work, we investigate machine comprehension on the challenging MCTest benchmark. Partly because of its limited size, prior work on MCTest has focused mainly on engineering better features. We tackle the dataset with a neural approach, harnessing simple neural networks arranged in a parallel hierarchy. The parallel hierarchy enables our model to compare the passage, question, and answer from a variety of trainable perspectives, as opposed to using a manually designed, rigid feature set. Perspectives range from the word level to sentence fragments to sequences of sentences; the networks operate only on word-embedding representations of text. When trained with a methodology designed to help cope with limited training data, our Parallel-Hierarchical model sets a new state of the art for MCTest, outperforming previous feature-engineered approaches slightly and previous neural approaches by a significant margin (over 15 percentage points).
机译:了解非结构化文本是自然语言处理的主要目标。理解测验会根据短篇文章提出问题,以评估这种理解。在这项工作中,我们将研究具有挑战性的MCTest基准测试对机器的理解。在一定程度上,由于其尺寸有限,有关MCTest的先前工作主要集中在设计更好的功能上。我们利用神经方法处理数据集,利用排列在并行层次结构中的简单神经网络。平行层次结构使我们的模型能够从各种可训练的角度比较段落,问题和答案,而不是使用手动设计的刚性特征集。视角范围从单词级别到句子片段再到句子序列;网络仅在文字的词嵌入表示上运行。当使用旨在帮助应对有限训练数据的方法进行训练时,我们的并行-层次模型为MCTest设定了新的技术水平,其性能略微优于以前的特征工程方法和明显优于先前的神经方法(超过15个百分点) 。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号