首页> 外文期刊>Language Resources and Evaluation >Automatic acquisition of syntactic verb classes with basic resources
【24h】

Automatic acquisition of syntactic verb classes with basic resources

机译:使用基本资源自动获取句法动词类

获取原文
获取原文并翻译 | 示例
       

摘要

This paper describes a methodology aimed at grouping Catalan verbs according to their syntactic behavior. Our goal is to acquire a small number of basic classes with a high level of accuracy, using minimal resources. Information on syntactic class, expensive and slow to compile by hand, is useful for any NLP task requiring specific lexical information. We show that it is possible to acquire this kind of information using only a POS-tagged corpus. We perform two clustering experiments. The first one aims at classifying verbs into transitive, intransitive and verbs alternating with a se-construction. Our system achieves an average 0.84 F-score, for a task with a 0.33 baseline. The second experiment aims at further distinguishing among pure intransitives and verbs bearing a prepositional object. The baseline for the task is 0.51 and the upperbound 0.98. The system achieves an average 0.88 F-score.
机译:本文介绍了一种旨在根据加泰罗尼亚语动词的句法行为对其进行分组的方法。我们的目标是使用最少的资源来获取少量高精度的基本类。有关语法类的信息(昂贵且手工编写较慢)对于需要特定词汇信息的任何NLP任务很有用。我们表明,仅使用带有POS标签的语料库就可以获得这种信息。我们执行两个聚类实验。第一个目的是将动词分为与se构式交替的和物,不及物和动词。对于以0.33为基准的任务,我们的系统平均获得0.84 F分数。第二个实验旨在进一步区分纯不及物动词和带有介词宾语的动词。任务的基准为0.51,上限为0.98。该系统平均获得0.88 F分数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号