Creating a manually error-tagged and shallow-parsed learner corpus

机译：创建一个带有人工错误标签和浅层分析的学习者语料库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The availability of learner corpora, especially those which have been manually error-tagged or shallow-parsed, is still limited. This means that researchers do not have a common development and test set for natural language processing of learner English such as for grammatical error detection. Given this background, we created a novel learner corpus that was manually error-tagged and shallow-parsed. This corpus is available for research and educational purposes on the web. In this paper, we describe it in detail together with its data-collection method and annotation schemes. Another contribution of this paper is that we take the first step toward evaluating the performance of existing POS-tagging/chunking techniques on learner corpora using the created corpus. These contributions will facilitate further research in related areas such as grammatical error detection and automated essay scoring.

机译：学习者语料库的可用性仍然受到限制，尤其是那些经过人工错误标记或浅层解析的学习者。这意味着研究人员对于学习者英语的自然语言处理（例如语法错误检测）没有共同的开发和测试集。在这种背景下，我们创建了一个新颖的学习者语料库，该语料库经过手动错误标记和浅层分析。该语料库可在网络上用于研究和教育目的。在本文中，我们将对其进行详细描述，以及其数据收集方法和注释方案。本文的另一个贡献是，我们迈出了第一步，即使用创建的语料库评估现有POS标记/分块技术对学习者语料库的性能。这些贡献将促进在相关领域的进一步研究，例如语法错误检测和自动作文评分。

著录项

来源
《Annual meeting of the Association for Computational Linguistics;ACL 2011》|2012年|p.1210-1219|共10页
会议地点
作者
Ryo Nagata; Edward Whittaker; Vera Sheinman;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. L2 English Learners' Performance in Persuasion Role-Plays: A Learner-Corpus-Based Study [J] . Shinichiro Ishikawa International Journal of Computer-Assisted Language Learning and Teaching . 2021,第2期

机译：L2英语学习者在说服角色扮演中的表现：基于学习者的学习者的研究
2. CityU corpus of essay drafts of English language learners: a corpus ofn textual revision in second language writing [J] . Lee John, Yeung Chak Yan, Zeldes Amir, Language Resources and Evaluation . 2015,第3期

机译：城大英语学习者论文草稿集：第二语言写作中的文本修订集
3. The Comparison of Collocation Use by Turkish and Asian Learners of English: The Case of TCSE Corpus and Icnale Corpus [J] . Elif Tokdemir Demirel, Semin Kazazo?lu Procedia - Social and Behavioral Sciences . 2015,第1期

机译：土耳其和亚洲学习者英语搭配使用的比较：以TCSE语料库和Icnale语料库为例
4. Creating a manually error-tagged and shallow-parsed learner corpus [C] . Ryo Nagata, Edward Whittaker, Vera Sheinman Annual meeting of the Association for Computational Linguistics . 2011

机译：创建手动错误标记和浅层学习的学习者语料库
5. A Contrastive Corpus Analysis on the Use of Connectors in Students’ Writing from 10 Asian Countries as Compared to Native Experts: Research from the ICNALE (The International Corpus Network of Asian Learners of English) [D] . Cho Min, Hyun Soon. 2020

机译：与本土专家相比，来自10个亚洲国家的学生写作的对比语料库分析
6. Using Desktop Publishing to Create Newsletters Handouts and Web Pages: A How-To-Do-It Manual and Library Public Relations Promotions and Communications: A How-To-Do-It Manual [O] . John A. Sable 1998

机译：使用桌面发布创建新闻稿讲义和网页：操作手册和图书馆公共关系促销和传播：操作手册
7. Capturing L2 accuracy developmental patterns: Insights from an error-tagged EFL learner corpus [O] . Thewissen Jennifer 2011

机译：捕获L2准确性发展模式：来自带有错误标签的EFL学习者语料库的见解

Creating a manually error-tagged and shallow-parsed learner corpus

摘要

著录项

相似文献

相关主题

期刊订阅