【24h】

The MERLIN corpus: Learner language and the CEFR

机译:MERLIN语料库:学习者语言和CEFR

获取原文

摘要

The MERLIN corpus is a written learner corpus for Czech, German, and Italian that has been designed to illustrate the Common European Framework of Reference for Languages (CEFR) with authentic learner data. The corpus contains 2,290 learner texts produced in standardized language certifications covering CEFR levels Al-Cl. The MERLIN annotation scheme includes a wide range of language characteristics that enable research into the empirical foundations of the CEFR scales and provide language teachers, test developers, and Second Language Acquisition researchers with concrete examples of learner performance and progress across multiple proficiency levels. For computational linguistics, it provide a range of authentic learner data for three target languages, supporting a broadening of the scope of research in areas such as automatic proficiency classification or native language identification. The annotated corpus and related information will be freely available as a corpus resource and through a freely accessible, didactically-oriented online platform.
机译:MERLIN语料库是针对捷克语,德语和意大利语的书面学习者语料库,旨在利用真实的学习者数据来说明《欧洲共同语言参考框架》(CEFR)。语料库包含2290种通过标准化语言认证产生的学习者文章,涵盖CEFR等级Al-Cl。 MERLIN注释方案包含广泛的语言特征,这些特征使我们能够研究CEFR量表的经验基础,并为语言老师,测试开发人员和第二语言习得研究人员提供学习者在多个熟练水平上的表现和进步的具体示例。对于计算语言学,它为三种目标语言提供了一系列可靠的学习者数据,从而支持了自动熟练分类或母语识别等领域的研究范围的扩大。带注释的语料库和相关信息将作为语料库资源,并通过可免费访问的,面向教学的在线平台免费提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号