首页> 外文期刊>Expert Systems with Application >An attentive neural architecture for joint segmentation and parsing and its application to real estate ads
【24h】

An attentive neural architecture for joint segmentation and parsing and its application to real estate ads

机译:用于联合分割和解析的细心神经架构及其在房地产广告中的应用

获取原文
获取原文并翻译 | 示例
       

摘要

In processing human produced text using natural language processing (NLP) techniques, two fundamental subtasks that arise are (i) segmentation of the plain text into meaningful subunits (e.g., entities), and (ii) dependency parsing, to establish relations between subunits. Such structural interpretation of text provides essential building blocks for upstream expert system tasks: e.g., from interpreting textual real estate ads, one may want to provide an accurate price estimate and/or provide selection filters for end users looking for a particular property - which all could rely on knowing the types and number of rooms, etc. In this paper, we develop a relatively simple and effective neural joint model that performs both segmentation and dependency parsing together, instead of one after the other as in most state-of-the-art works. We will focus in particular on the real estate ad setting, aiming to convert an ad to a structured description, which we name property tree, comprising the tasks of (1) identifying important entities of a property (e.g., rooms) from classifieds and (2) structuring them into a tree format. In this work, we propose a new joint model that is able to tackle the two tasks simultaneously and construct the property tree by (i) avoiding the error propagation that would arise from the subtasks one after the other in a pipelined fashion, and (ii) exploiting the interactions between the subtasks. For this purpose, we perform an extensive comparative study of the pipeline methods and the new proposed joint model, reporting an improvement of over three percentage points in the overall edge F-1 score of the property tree. Also, we propose attention methods, to encourage our model to focus on salient tokens during the construction of the property tree. Thus we experimentally demonstrate the usefulness of attentive neural architectures for the proposed joint model, showcasing a further improvement of two percentage points in edge F-1 score for our application. While the results demonstrated are for the particular real estate setting, the model is generic in nature, and thus could be equally applied to other expert system scenarios requiring the general tasks of both (i) detecting entities (segmentation) and (ii) establishing relations among them (dependency parsing). (C) 2018 Elsevier Ltd. All rights reserved.
机译:在使用自然语言处理(NLP)技术处理人为生成的文本时,出现了两个基本子任务:(i)将纯文本分割为有意义的子单元(例如实体),以及(ii)依赖项解析,以建立子单元之间的关系。这种文本的结构化解释为上游专家系统任务提供了必要的构建块:例如,从解释文本房地产广告开始,人们可能希望提供准确的价格估算和/或为寻找特定财产的最终用户提供选择过滤器-所有这些可能依赖于了解房间的类型和数量等。在本文中,我们开发了一个相对简单有效的神经联合模型,该模型可以同时执行分段和依存关系解析,而不是像大多数情况一样一个接一个地进行解析艺术作品。我们将特别关注房地产广告设置,旨在将广告转换为结构化描述,我们将其命名为属性树,包括以下任务:(1)从分类中识别属性的重要实体(例如房间),以及( 2)将它们构造成树格式。在这项工作中,我们提出了一种新的联合模型,该模型能够同时解决这两个任务并通过(i)避免以流水线方式接连出现子任务而引起的错误传播,以及(ii )利用子任务之间的互动。为此,我们对流水线方法和新提出的联合模型进行了广泛的比较研究,报告了属性树的整体边缘F-1得分提高了三个百分点以上。此外,我们提出了注意方法,以鼓励我们的模型在构造属性树期间将重点放在显着标记上。因此,我们通过实验证明了注意神经体系结构对于所提出的联合模型的有用性,展示了边缘F-1分数在我们的应用中进一步提高了两个百分点。虽然所显示的结果是针对特定房地产设置的,但该模型本质上是通用的,因此可以同等地应用于需要(i)检测实体(细分)和(ii)建立关系两者的一般任务的其他专家系统方案其中(依赖项解析)。 (C)2018 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号