首页> 外文会议>International Conference on Statistical Language and Speech Processing >A Cognitive Approach to Parsing with Neural Networks
【24h】

A Cognitive Approach to Parsing with Neural Networks

机译:用神经网络解析的认知方法

获取原文
获取外文期刊封面目录资料

摘要

According to Cognitive Grammar (CG) theory, the overall structure of a natural language is motivated by a relatively small set of domain-independent cognitive abilities. In this paper, we draw insights from CG to propose an approach to natural language parsing with little syntactic annotation. A sentence functions as a cohesive whole because its parts are meaningfully linked. We propose that every part of a sentence can be analysed along three axes: composition, interaction and autonomy. When two expressions semantically correspond in all the three axes we call them cohesive. We present an algorithm that reads parts of sentences incrementally, recognises their construction schemas along the three axes, assembles any two component schemas into one composite schema if they are cohesive, parses a span of text as incrementally successive assembly of components into composites, retains multiple running parses within the span and chooses the best parse. The basic construction schema definitions and their patterns of assembly are implemented as dictionary-cum-rules because they are fewer in number, largely language-independent and can be extended to handle language-specific variations. A basic feedforward neural network component was trained to learn all valid patterns of assemblies possible in a span of text and to choose the best parse. A successful parse exhausts all the words in the sentence and ensures local cohesion and assembly at every stage of analysis. We present our approach, parser implementation and evaluation results in Welsh and English. By adding WordNet synsets we are able to show improvements in parser performance.
机译:根据认知语法(CG)理论,自然语言的整体结构是由相对较小的域的独立认知能力的激励。在本文中,我们吸引了CG的见解,提出了一种用很少的句法注释解析自然语言的方法。一个句子用作凝聚力的整体,因为它的部件有意义地链接。我们建议可以沿三个轴分析句子的每个部分:构成,互动和自主权。当两个表达式在所有三个轴上语义上对应时,我们称之为凝聚力。我们呈现了一种算法逐步读取部分句子,沿三个轴识别其构造模式,如果它们是凝聚力的,将任意两个组件模式组装成一个复合模式,将文本的跨度解析为复合材料的组件递增连续组装,保留多个在跨度内运行parses并选择最好的解析。基本构造模式定义及其组装模式在字典中实现为Quic-Rule,因为它们的数量较少,很大程度上是语言无关的,并且可以扩展到处理特定语言的变体。培训基本的前馈神经网络组件,以了解文本跨度中可能的所有有效组件模式,并选择最佳解析。成功的解析排除了句子中的所有单词,并在每个分析阶段确保局部凝聚力和组装。我们介绍了我们的方法,解析器实施和评估结果在威尔士语和英语中。通过添加WordNet Synpsets,我们能够显示Parser性能的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号