Non-size increasing graph rewriting for natural language processing

GUILLAUME BONFANTE; BRUNO GUILLAUME

首页> 外文期刊>Mathematical structures in computer science >Non-size increasing graph rewriting for natural language processing

【24h】

Non-size increasing graph rewriting for natural language processing

机译：用于自然语言处理的非大小增加图形重写

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

A very large amount of work in Natural Language Processing (NLP) use tree structure as the first class citizen mathematical structures to represent linguistic structures, such as parsed sentences or feature structures. However, some linguistic phenomena do not cope properly with trees; for instance, in the sentence ‘Max decides to leave,’ ‘Max’ is the subject of the both predicates ‘to decide’ and ‘to leave.’ Tree-based linguistic formalisms generally use some encoding to manage sentences like the previous example. In former papers (Bonfante et al. 2011; Guillaume and Perrier 2012), we discussed the interest to use graphs rather than trees to deal with linguistic structures, and we have shown how Graph Rewriting could be used for their processing, for instance in the transformation of the sentence syntax into its semantics. Our experiments have shown that Graph Rewriting applications to NLP do not require the full computational power of the general Graph Rewriting setting. The most important observation is that all graph vertices in the final structures are in some sense ‘predictable’ from the input data, and so we can consider the framework of Non-size increasing Graph Rewriting. In our previous papers, we have formally described the Graph Rewriting calculus we used and our purpose here is to study the theoretical aspect of termination with respect to this calculus. Given that termination is undecidable in general, we define termination criterions based on weight, we prove the termination of weighted rewriting systems, and we give complexity bounds on derivation lengths for these rewriting systems.

机译：自然语言处理（NLP）中的大量工作使用树结构作为一类公民数学结构来表示语言结构，例如解析的句子或特征结构。但是，某些语言现象无法适当地应付树木。例如，在句子“麦克斯决定离开”中，“麦克斯”是谓词“决定”和“离开”的主语。基于树的语言形式主义通常使用某种编码来管理句子，如上例所示。在以前的论文中（Bonfante等人，2011； Guillaume和Perrier，2012），我们讨论了使用图而不是树来处理语言结构的兴趣，并且我们展示了如何使用图重写来处理它们，例如将句子语法转换为其语义。我们的实验表明，将图形重写应用到NLP并不需要常规图形重写设置的全部计算能力。最重要的观察结果是，最终结构中的所有图形顶点在某种意义上都可以根据输入数据“预测”，因此我们可以考虑使用不加大小图形重写的框架。在我们以前的论文中，我们正式描述了我们使用的“图形重写”演算，我们的目的是研究与该演算有关的终止的理论方面。鉴于通常无法确定终止，我们基于权重定义终止标准，我们证明了加权重写系统的终止，并给出了这些重写系统的派生长度的复杂性界限。

著录项

来源
《Mathematical structures in computer science》 |2018年第8期|1451-1484|共34页
作者
GUILLAUME BONFANTE; BRUNO GUILLAUME;
展开▼
作者单位

LORIA/INRIA-BP239, 615 Rue du Jardin-Botanique, 54506 Vandoeuvre-l`es-Nancy, France;

LORIA/INRIA-BP239, 615 Rue du Jardin-Botanique, 54506 Vandoeuvre-l`es-Nancy, France;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. An Arabic natural language interface for querying relational databases based on natural language processing and graph theory methods [J] . Hanane Bais, Mustapha Machkour, Lahcen Koutti International journal of reasoning-based intelligent systems . 2018,第2期

机译：基于自然语言处理和图论方法的阿拉伯自然语言界面查询关系数据库
2. Functional language processing via term rewriting system and its relation to graph transformation system [J] . Yoshio Sugito 電子情報通信学会技術研究報告. ソフトウェアサイエンス. Software Science . 2000,第185期

机译：通过术语重写系统进行功能语言处理及其与图形转换系统的关系
3. Functional language processing via term rewriting system and its relation to graph transformation system [J] . Yoshio Sugito 電子情報通信学会技術研究報告. ソフトウェアサイエンス. Software Science . 2000,第185期

机译：通过术语重写系统进行功能语言处理及其与图形转换系统的关系
4. RQUERY: Rewriting Natural Language Queries on Knowledge Graphs to Alleviate the Vocabulary Mismatch Problem [C] . Saeedeh Shekarpour, Edgard Marx, Soren Auer, AAAI Conference on Artificial Intelligence . 2017

机译：rQuery：重写知识图表的自然语言查询，以缓解词汇错配问题
5. Practical natural language processing question answering using graphs. [D] . Fuchs, Gil Emanuel. 2004

机译：实用的自然语言处理，使用图表进行答疑。
6. Controlled Vocabularies Indexing and Medical Language Processing. Medical Language Processing: Database Capture of Natural Language Echocardiographic Reports: A Unified Medical Language System Approach [O] . K. Canfield, B. Bray, S. Huff, 1989

机译：受控词汇表索引编制和医学语言处理。医学语言处理：自然语言超声心动图报告的数据库捕获：统一医学语言系统方法
7. Cooperating rewrite processes for natural-language analysis [O] . Filgueiras Miguel 1986

机译：配合重写过程进行自然语言分析

Non-size increasing graph rewriting for natural language processing

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅