首页> 外文会议>9th International conference on language resources and evaluation >The IULA Spanish LSP Treebank: building and browsing
【24h】

The IULA Spanish LSP Treebank: building and browsing

机译:IULA西班牙LSP树库:构建和浏览

获取原文

摘要

This paper presents the IULA Spanish LSP Treebank, a dependency treebank of over 41,000 sentences of different domains (Law, Economy, Computing Science, Environment, and Medicine), developed in the framework of the European project METANET4U. Dependency annotations in the treebank were automatically derived from manually selected parses produced by an HPSG-grammar by a deterministic conversion algorithm that used the identifiers of grammar rules to identify the heads, the dependents, and some dependency types that were directly transferred onto the dependency structure (e.g., subject, specifier, and modifier), and the identifiers of the lexical entries to identify the argument-related dependency functions (e.g. direct object, indirect object, and oblique complement). The treebank is accessible with a browser that provides concordance-based search functions and delivers the results in two formats: (ⅰ) a column-based format, in the style of CoNLL-2006 shared task, and (ⅱ) a dependency graph, where dependency relations are noted by an oriented arrow which goes from the dependent node to the head node. The IULA Spanish LSP Treebank is the first technical corpus of Spanish annotated at surface syntactic level following the dependency grammar theory. The treebank has been made publicly and freely available from the META-SHARE platform with a Creative Commons CC-by licence.
机译:本文介绍了IULA西班牙语LSP树库,它是在欧洲项目METANET4U的框架下开发的,包含超过41,000个不同域(法律,经济,计算机科学,环境和医学)的句子的依赖树库。树库中的依赖项注释是通过确定性转换算法从HPSG语法生成的手动选择的分析中自动得出的,确定性转换算法使用语法规则的标识符来识别标头,依赖项和某些直接转换为依赖项结构的依赖项类型(例如,主题,说明符和修饰符),以及词汇表条目的标识符,以标识与自变量相关的依赖函数(例如,直接宾语,间接宾语和斜补语)。可以通过浏览器访问树库,该浏览器提供基于一致性的搜索功能,并以两种格式提供结果:(ⅰ)采用CoNLL-2006共享任务样式的基于列的格式,以及(ⅱ)依赖关系图,其中从属节点到头节点之间的定向箭头指示了从属关系。 IULA西班牙语LSP树库是遵循依存语法理论的第一个在表面句法级别上标注的西班牙语技术语料库。该树库已通过知识共享CC许可从META-SHARE平台公开免费提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号