【24h】

Robust Morphological Tagging with Word Representations

机译:带有词表示的强大形态标记

获取原文
获取外文期刊封面目录资料

摘要

We present a comparative investigation of word representations for part-of-speech (POS) and morphological tagging, focusing on scenarios with considerable differences between training and test data where a robust approach is necessary. Instead of adapting the model towards a specific domain we aim to build a robust model across domains. We developed a test suite for robust tagging consisting of six languages and different domains. We find that representations similar to Brown clusters perform best for POS tagging and that word representations based on linguistic morphological analyzers perform best for morphological tagging.
机译:我们对词性(POS)和词法标记的词表示形式进行了比较研究,重点研究了在需要鲁棒方法的情况下,训练数据和测试数据之间存在显着差异的场景。与其针对特定领域调整模型,我们的目标是跨领域构建健壮的模型。我们开发了一套用于六种语言和不同域的健壮标记测试套件。我们发现,类似于Brown簇的表示形式最适合POS标记,而基于语言形态分析器的单词表示形式最适合于形态标记。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号