Towards the Development of a Hybrid Parser for Natural Languages

Sardar F. Jaf; Allan Ramsay

首页> 外文期刊>OASIcs : OpenAccess Series in Informatics >Towards the Development of a Hybrid Parser for Natural Languages

【24h】

Towards the Development of a Hybrid Parser for Natural Languages

机译：面向自然语言的混合解析器的发展

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In order to understand natural languages, we have to be able to determine the relations between words, in other words we have to be able to 'parse' the input text. This is a difficult task, especially for Arabic, which has a number of properties that make it particularly difficult to handle. There are two approaches to parsing natural languages: grammar-driven and data-driven. Each of these approaches poses its own set of problems, which we discuss in this paper. The goal of our work is to produce a hybrid parser, which retains the advantages of the data-driven approach but is guided by grammar rules in order to produce more accurate output. This work consists of two stages: the first stage is to develop a baseline data-driven parser, which is guided by a machine learning algorithm for establishing dependency relations between words. The second stage is to integrate grammar rules into the baseline parser. In this paper, we describe the first stage of our work, which is now implemented, and a number of experiments that have been conducted on this parser. We also discuss the result of these experiments and highlight the different factors that are affecting parsing speed and the correctness of the parser results.

机译：为了理解自然语言，我们必须能够确定单词之间的关系，换句话说，我们必须能够“解析”输入文本。这是一项艰巨的任务，尤其是对于阿拉伯语而言，阿拉伯语具有许多使其难以处理的特性。解析自然语言有两种方法：语法驱动和数据驱动。这些方法中的每一种都有其自身的问题集，我们将在本文中进行讨论。我们工作的目标是产生一个混合解析器，该解析器保留了数据驱动方法的优点，但以语法规则为指导以产生更准确的输出。这项工作包括两个阶段：第一个阶段是开发基线数据驱动的解析器，该解析器由机器学习算法指导以建立单词之间的依赖关系。第二阶段是将语法规则集成到基线解析器中。在本文中，我们描述了我们的工作的第一阶段（现已实施）以及在此解析器上进行的大量实验。我们还将讨论这些实验的结果，并突出显示影响解析速度和解析器结果正确性的不同因素。

著录项

来源
《OASIcs : OpenAccess Series in Informatics》 |2013年第1期|共8页
作者
Sardar F. Jaf; Allan Ramsay;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Lexicalized and Statistical Parsing of Natural Language Text in Tamil using Hybrid Language Models [J] . M. SELVAM, A. M. NATARAJAN, R. THANGARAJAN WSEAS Transactions on Computers . 2008,第8期

机译：使用混合语言模型对泰米尔语中的自然语言文本进行词汇化和统计分析
2. IDL-Expressions: A Formalism for Representing and Parsing Finite Languages in Natural Language Processing [J] . Nederhof M. J., Satta G. The Journal of Artificial Intelligence Research . 2004,第12期

机译：IDL表达式：在自然语言处理中表示和解析有限语言的形式主义
3. IDL-Expressions: A Formalism for Representing and Parsing Finite Languages in Natural Language Processing [J] . Mark-Jan Nederhof, Giorgio Satta The Journal of Artificial Intelligence Research . 2004,第0期

机译：IDL表达式：在自然语言处理中表示和解析有限语言的形式主义
4. Towards the Development of a Hybrid Parser for Natural Languages [C] . Sardar F. Jaf, Allan Ramsay Imperial College Computing Student Workshop . 2013

机译：旨在为自然语言进行混合解析器的发展
5. Any domain parsing: Automatic domain adaptation for natural language parsing. [D] . McClosky, David. 2010

机译：任何域解析：自动域适应自然语言解析。
6. Natural language processing systems for pathology parsing in limited data environments with uncertainty estimation [O] . Anobel Y Odisho, Briton Park, Nicholas Altieri, 2020

机译：具有不确定性估计的有限数据环境的病理学处理系统的自然语言处理系统
7. Towards the development of a hybrid parser for natural languages. [O] . Sardar Jaf, Allan Ramsay 2013

机译：致力于开发针对自然语言的混合解析器。

Towards the Development of a Hybrid Parser for Natural Languages

摘要

著录项

相似文献

相关主题

期刊订阅