首页> 外文会议>Brazilian Symposium in Information and Human Language Technology >Mac-Morpho Revisited: Towards Robust Part-of-Speech Tagging
【24h】

Mac-Morpho Revisited: Towards Robust Part-of-Speech Tagging

机译:重新审视Mac-Morpho:迈向强大的词性标记

获取原文

摘要

We present a revision of Mac-Morpho, the biggest corpus of Portuguese text containing manually annotated POS tags. Many errors were corrected, yielding a much more reliable resource. We also trained a neural network based classifier for the POS tagging task, following an architecture that achieves state-of-the-art results in English. Our tagger maps each word to a real valued vector and uses it as input, thus dealing with abstract features. These vectors are induced by distributional semantics techniques, and provide the tagger with information for achieving 96.48% accuracy.
机译:我们提出了Mac-Morpho的修订版,Mac-Morpho是最大的葡萄牙语文本语料库,其中包含手动注释的POS标签。更正了许多错误,从而产生了更加可靠的资源。我们还遵循一种以英语实现最新结果的体系结构,为POS标记任务训练了基于神经网络的分类器。我们的标记器将每个单词映射到一个实值向量,并将其用作输入,从而处理抽象特征。这些向量是由分布语义技术引入的,并为标记器提供了达到96.48%准确性的信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号