首页> 外文会议>Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies >NT2Lex: A CEFR-Graded Lexical Resource for Dutch as a Foreign Language Linked to Open Dutch WordNet
【24h】

NT2Lex: A CEFR-Graded Lexical Resource for Dutch as a Foreign Language Linked to Open Dutch WordNet

机译:NT2LEX:荷兰语的CEFR分级词汇资源,作为与打开荷兰Wordnet的外语相关的外语

获取原文

摘要

In this paper, we introduce NT2Lex, a novel lexical resource for Dutch as a foreign language (NT2) which includes frequency distributions of 17,743 words and expressions attested in expert-written textbook texts and readers graded along the scale of the Common European Framework of Reference (CEFR). In essence, the lexicon informs us about what kind of vocabulary should be understood when reading Dutch as a non-native reader at a particular proficiency level. The main novelty of the resource with respect to the previously developed CEFR-graded lexicons concerns the introduction of corpus-based evidence for L2 word sense complexity through the linkage to Open Dutch WordNet (Postma et al., 2016). The resource thus contains, on top of the lemmatised and part-of-speech tagged lexical entries, a total of 11,999 unique word senses and 8.934 distinct synsets.
机译:在本文中,我们介绍了NT2LEX,作为荷兰语的新词汇资源,作为外语(NT2),其中包括频率分布为17,743个单词和表达式,在专家写的教科书文本和读者沿着欧洲共同欧洲的框架等级分配(CEFR)。从本质上讲,Lexicon在阅读荷兰语作为非原生读者处于特定熟练程度时应在读取荷兰语时理解哪种词汇。关于先前开发的CEFR分级词典的资源的主要新颖性涉及通过联动方式引入基于语料库的证据,以通过联系打开荷兰Wordnet(Postma等,2016)。因此,资源包含在lemmated和词语标记的词汇条目的顶部,共11,999个唯一的单词感官和8.934个不同的拟序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号