首页> 外文会议>International Workshop on Computational Processing of the Portuguese Language >An Account of the Challenge of Tagging a Reference Corpus for Brazilian Portuguese
【24h】

An Account of the Challenge of Tagging a Reference Corpus for Brazilian Portuguese

机译:对巴西葡萄牙语标记参考语料库的挑战的说法

获取原文

摘要

This article identifies and addresses the major linguistic/conceptual, as opposed to logistic, issues faced in the morphosyntactic tagging of MAC-Morpho, a 1.1 million word Brazilian Portuguese corpus of newspaper articles that has been developed in the Lacio-Web Project. Rather than simply presenting the annotated corpus and describing its tagset, we elaborate on the criteria for establishing the tagset and analyze some interesting cases amongst the linguistic problems we faced in this work.
机译:本文确定并解决了主要的语言/概念,而不是物流,在Lac-Web项目中的报纸文章的110万字巴西葡萄牙语中面临的逻辑问题面临的问题,这是在Lacio-Web项目中开发的。我们不是简单地介绍注释的语料库并描述其TAGSET,我们详细阐述了建立标签的标准,并在我们面临的语言问题中分析一些有趣的情况。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号