首页> 外文会议>Thirteenth workshop on innovative use of NLP for building educational applications 2018 >NT2Lex: A CEFR-Graded Lexical Resource for Dutch as a Foreign Language Linked to Open Dutch WordNet
【24h】

NT2Lex: A CEFR-Graded Lexical Resource for Dutch as a Foreign Language Linked to Open Dutch WordNet

机译:NT2Lex:CEFR分级的词汇资源,用于将荷兰语作为外语链接到开放式荷兰词网

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we introduce NT2Lex, a novel lexical resource for Dutch as a foreign language (NT2) which includes frequency distributions of 17,743 words and expressions attested in expert-written textbook texts and readers graded along the scale of the Common European Framework of Reference (CEFR). In essence, the lexicon informs us about what kind of vocabulary should be understood when reading Dutch as a non-native reader at a particular proficiency level. The main novelty of the resource with respect to the previously developed CEFR-graded lexicons concerns the introduction of corpus-based evidence for L2 word sense complexity through the linkage to Open Dutch WordNet (Postma et al., 2016). The resource thus contains, on top of the lemmatised and part-of-speech tagged lexical entries, a total of 11,999 unique word senses and 8.934 distinct synsets.
机译:在本文中,我们介绍NT2Lex,这是一种新型的荷兰语作为外语(NT2)的词汇资源,其中包括17,743个单词和表达式的频率分布,这些单词和表达式在专家编写的教科书文本中得到证明,并且读者按照《欧洲参考标准》的等级进行分级(CEFR)。本质上,该词典告知我们在特定水平的荷兰语作为非母语读者阅读时应理解的词汇表。与先前开发的CEFR分级词典相关的资源的主要新颖之处在于,通过与开放荷兰WordNet的链接,引入了基于语料库的L2词义复杂性证据(Postma等,2016)。因此,该资源在经过修饰词和词性标记的词法条目的顶部,总共包含11,999个唯一的词义和8.934个不同的同义词集。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号