...
首页> 外文期刊>OASIcs : OpenAccess Series in Informatics >Towards the Development of a Hybrid Parser for Natural Languages
【24h】

Towards the Development of a Hybrid Parser for Natural Languages

机译:面向自然语言的混合解析器的发展

获取原文
   

获取外文期刊封面封底 >>

       

摘要

In order to understand natural languages, we have to be able to determine the relations between words, in other words we have to be able to 'parse' the input text. This is a difficult task, especially for Arabic, which has a number of properties that make it particularly difficult to handle. There are two approaches to parsing natural languages: grammar-driven and data-driven. Each of these approaches poses its own set of problems, which we discuss in this paper. The goal of our work is to produce a hybrid parser, which retains the advantages of the data-driven approach but is guided by grammar rules in order to produce more accurate output. This work consists of two stages: the first stage is to develop a baseline data-driven parser, which is guided by a machine learning algorithm for establishing dependency relations between words. The second stage is to integrate grammar rules into the baseline parser. In this paper, we describe the first stage of our work, which is now implemented, and a number of experiments that have been conducted on this parser. We also discuss the result of these experiments and highlight the different factors that are affecting parsing speed and the correctness of the parser results.
机译:为了理解自然语言,我们必须能够确定单词之间的关系,换句话说,我们必须能够“解析”输入文本。这是一项艰巨的任务,尤其是对于阿拉伯语而言,阿拉伯语具有许多使其难以处理的特性。解析自然语言有两种方法:语法驱动和数据驱动。这些方法中的每一种都有其自身的问题集,我们将在本文中进行讨论。我们工作的目标是产生一个混合解析器,该解析器保留了数据驱动方法的优点,但以语法规则为指导以产生更准确的输出。这项工作包括两个阶段:第一个阶段是开发基线数据驱动的解析器,该解析器由机器学习算法指导以建立单词之间的依赖关系。第二阶段是将语法规则集成到基线解析器中。在本文中,我们描述了我们的工作的第一阶段(现已实施)以及在此解析器上进行的大量实验。我们还将讨论这些实验的结果,并突出显示影响解析速度和解析器结果正确性的不同因素。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号