首页> 外文会议>Language processing and intelligent information systems >Mapping Named Entities from NKJP Corpus to Skladnica Treebank and Polish Wordnet
【24h】

Mapping Named Entities from NKJP Corpus to Skladnica Treebank and Polish Wordnet

机译:从NKJP语料库到Skladnica树库和波兰语Wordnet的命名实体映射

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In this paper a method of mapping named entities from NKJP corpus, where their annotation is rather coarse, to Skladnica tree-bank, where their annotation is wordnet-based, is discussed. The method is based on the fact that Skladnica is a subcorpus of the one-million-word manually annotated balanced subcorpus of NKJP. The method to find a corresponding node in a parse tree is presented. Next, several heuristics to match the lemma of an NE in Polish Wordnet and to choose the most probable semantic interpretation of ambiguous ones are suggested. The results of the mapping are evaluated.
机译:本文讨论了一种将命名实体从NKJP语料库(其注释相当粗糙)映射到Skladnica树库(其注释基于词网)的方法。该方法基于以下事实:Skladnica是NKJP的一百万个单词的手动注释的平衡子集的子集。提出了在解析树中找到对应节点的方法。接下来,提出了几种启发式方法,以匹配波兰语Wordnet中NE的引理,并选择最可能的歧义语义解释。评估映射结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号