首页> 外文会议>9th International conference on language resources and evaluation >Extracting semantic relations from Portuguese corpora using lexical-syntactic patterns
【24h】

Extracting semantic relations from Portuguese corpora using lexical-syntactic patterns

机译:利用词法语法模式从葡萄牙语中提取语义关系

获取原文

摘要

The growing investment on automatic extraction procedures, together with the need for extensive resources, makes semi-automatic construction a new viable and efficient strategy for developing of language resources, combining accuracy, size, coverage and applicability. These assumptions motivated the work depicted in this paper, aiming at the establishment and use of lexical-syntactic patterns for extracting semantic relations for Portuguese from corpora, part of a larger ongoing project for the semi-automatic extension of WordNet.PT. 26 lexical-syntactic patterns were established, covering hypemymy/hyponymy and holonymy/meronymy relations between nominal items, and over 34 000 contexts were manually analyzed to evaluate the productivity of each pattern. The set of patterns and respective examples are given, as well as data concerning the extraction of relations - right hits, wrong hits and related hits-, as well as the total of occurrences of each pattern in CPRC. Although language-dependent, and thus clearly of obvious interest for the development of lexical resources for Portuguese, the results depicted in this paper are also expected to be helpful as a basis for the establishment of patterns for related languages such as Spanish, Catalan, French or Italian.
机译:对自动提取程序的越来越多的投资以及对广泛资源的需求,使半自动构建成为语言资源的新的可行和有效的战略,结合准确性,尺寸,覆盖率和适用性。这些假设的激励了本文所描述的工作,旨在建立和使用词汇句法模式,用于从Corpora从Corpora提取葡萄牙语的语义关系,这是一个较大的持续项目,用于Wordnet的半自动扩展。建立了26个词汇句法模式,涵盖了低音型/下值和孤象的名义项目之间的孤墓关系,并且手动分析了超过34 000个上下文,以评估每个模式的生产力。给出了一组模式和各个示例,以及关于关系的提取的数据 - 右命令,错误的命中和相关的命中 - 以及CPRC中每个模式的总体发生。虽然依赖语言,但显然对葡萄牙语的词汇资源的发展有明显的兴趣,但本文所描述的结果也有助于作为建立与西班牙语,加泰罗尼亚,法国的相关语言模式的基础的基础或意大利语。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号