首页> 外文会议>9th International conference on language resources and evaluation >The IULA Spanish LSP Treebank: building and browsing
【24h】

The IULA Spanish LSP Treebank: building and browsing

机译:西班牙语的IULA西班牙人:建筑和浏览

获取原文

摘要

This paper presents the IULA Spanish LSP Treebank, a dependency treebank of over 41,000 sentences of different domains (Law, Economy, Computing Science, Environment, and Medicine), developed in the framework of the European project METANET4U. Dependency annotations in the treebank were automatically derived from manually selected parses produced by an HPSG-grammar by a deterministic conversion algorithm that used the identifiers of grammar rules to identify the heads, the dependents, and some dependency types that were directly transferred onto the dependency structure (e.g., subject, specifier, and modifier), and the identifiers of the lexical entries to identify the argument-related dependency functions (e.g. direct object, indirect object, and oblique complement). The treebank is accessible with a browser that provides concordance-based search functions and delivers the results in two formats: (ⅰ) a column-based format, in the style of CoNLL-2006 shared task, and (ⅱ) a dependency graph, where dependency relations are noted by an oriented arrow which goes from the dependent node to the head node. The IULA Spanish LSP Treebank is the first technical corpus of Spanish annotated at surface syntactic level following the dependency grammar theory. The treebank has been made publicly and freely available from the META-SHARE platform with a Creative Commons CC-by licence.
机译:本文介绍了IULA西班牙语LSP TreeBank,这是一个在欧洲项目Metanet4u的框架内开发的不同域名(法律,经济,计算科学,环境和医学)超过41,000句的依赖树班币。 TreeBank中的依赖性注释由由HPSG-GRAMMAR产生的手动选定的解析通过确定型转换算法自动派生,该算法使用语法规则的标识符来标识直接转移到依赖结构上的头部,依赖项和一些依赖性类型(例如,主题,说明符和修饰符)以及词汇条目的标识符,以识别与参数相关的依赖函数(例如直接对象,间接对象和倾斜补充)。 TreeBank可以使用浏览器提供提供基于一致的搜索功能,并以两种格式传递结果:(Ⅰ)基于列的格式,符合Conll-2006共享任务的风格,(Ⅱ)依赖图,其中由从属节点到头节点的取向箭头注意依赖关系。 IULA西班牙语LSP TreeBank是在依赖语法理论之后的表面句法水平的第一个西班牙语技术语料库。 TreeBank已经公开,可自由地从Meta-Serve平台上免费提供,具有CC-By许可证。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号