首页> 外文会议>International conference on language resources and evaluation >Combining Language Resources Into A Grammar-Driven Swedish Parser
【24h】

Combining Language Resources Into A Grammar-Driven Swedish Parser

机译:将语言资源合并为语法驱动的瑞典语解析器

获取原文

摘要

This paper describes work on a rule-based, open-source parser for Swedish. The central component is a wide-coverage grammar implemented in the GF formalism (Grammatical Framework), a dependently typed grammar formalism based on Martin-L6f type theory. GF has strong support for multilinguality and has so far been used successfully for controlled languages (Angelov and Ranta, 2009) and recent experiments have showed that it is also possible to use the framework for parsing unrestricted language. In addition to GF, we use two other main resources: the Swedish treebank Talbanken and the electronic lexicon SALDO. By combining the grammar with a lexicon extracted from SALDO we obtain a parser accepting all sentences described by the given rules. We develop and test this on examples from Talbanken. The resulting parser gives a full syntactic analysis of the input sentences. It will be highly reusable, freely available, and as GF provides libraries for compiling grammars to a number of programming languages, chosen parts of the the grammar may be used in various NLP applications.
机译:本文介绍了针对瑞典语的基于规则的开放源代码解析器的工作。核心部分是在GF形式主义(语法框架)中实施的广泛语法,GF形式主义是基于Martin-L6f类型理论的从属类型语法形式。 GF对多种语言提供了强大的支持,并且迄今为止已成功地用于受控语言(Angelov和Ranta,2009),最近的实验表明,也可以使用该框架来解析不受限制的语言。除GF外,我们还使用其他两个主要资源:瑞典树库Talbanken和电子词典SALDO。通过将语法与从SALDO中提取的词典进行组合,我们获得了一个解析器,该解析器接受给定规则描述的所有句子。我们根据塔尔班肯(Talbanken)的示例进行开发和测试。结果分析器对输入句子进行完整的句法分析。它将高度可重用,免费提供,并且由于GF提供了用于将语法编译为多种编程语言的库,因此语法的选定部分可用于各种NLP应用程序中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号