首页> 外文会议>International Workshop on Computational Processing of the Portuguese Language >Automatic Semantic Role Labeling on Non-revised Syntactic Trees of Journalistic Texts
【24h】

Automatic Semantic Role Labeling on Non-revised Syntactic Trees of Journalistic Texts

机译:关于新闻文本未修改的句法树的自动语义角色标记

获取原文

摘要

Semantic Role Labeling (SRL) is a Natural Language Processing task that enables the detection of events described in sentences and the participants of these events. For Brazilian Portuguese (BP), there are two studies recently concluded that perform SRL in journalistic texts. [1] obtained F1-measure scores of 79.6, using the Prop-Bank.Br corpus, which has syntactic trees manually revised; [8], without using a treebank for training, obtained F1-measure scores of 68.0 for the same corpus. However, the use of manually revised syntactic trees for this task does not represent a real scenario of application. The goal of this paper is to evaluate the performance of SRL on revised and non-revised syntactic trees using a larger and balanced corpus of BP journalistic texts. First, we have shown that [1]'s system also performs better than [8]'s system on the larger corpus. Second, the SRL system trained on non-revised syntactic trees performs better over non-revised trees than a system trained on gold-standard data.
机译:语义角色标记(SRL)是一种自然语言处理任务,可以检测句子中描述的事件和这些事件的参与者。对于巴西葡萄牙语(BP),最近有两项研究得出结论,在新闻文本中进行SRL。 [1]获得F1测量分数79.6,使用ProC-Bank.BR语料库,其手动修改句法树木; [8],不使用TreeBank进行培训,为相同的语料库获得68.0的F1测量分数。但是,为此任务使用手动修订的语法树不代表应用程序的实际情况。本文的目标是使用BP新闻文本的较大和平衡的语料库来评估SRL对修订和未修改的句法树的性能。首先,我们已经表明,[1]系统也比较大的语料库上的系统更好地表现优于[8]。其次,在非修订的句法树上培训的SRL系统在非修订的树上培训比在金标准数据上培训的系统更好地执行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号