首页> 外文会议>Workshop on Innovative Use of NLP for Building Educational Applications >Maximizing Classification Accuracy in Native Language Identification

【24h】

Maximizing Classification Accuracy in Native Language Identification

机译：最大化母语识别中的分类准确性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper reports our contribution to the 2013 NLI Shared Task. The purpose of the task was to train a machine-learning system to identify the native-language affiliations of 1,100 texts written in English by nonnative speakers as part of a high-stakes test of general academic English proficiency. We trained our system on the new TOEFL 11 corpus, which includes 11,000 essays written by nonnative speakers from 11 native-language backgrounds. Our final system used an SVM classifier with over 400,000 unique features consisting of lexical and POS n-grams occurring in at least two texts in the training set. Our system identified the correct native-language affiliations of 83.6% of the texts in the test set. This was the highest classification accuracy achieved in the 2013 NLI Shared Task.

机译：本文向2013年NLI共享任务报告了我们对2013年的贡献。该任务的目的是培训机器学习系统，以识别非营利扬声器用英语编写的1,100个文本的本土语言隶属关系，作为一般学术英语水平的高赌注测试的一部分。我们在新托福11个语料库上培训了我们的系统，其中包括来自11个母语背景的非扬声器编写的11,000名论文。我们的最终系统使用了SVM分类器，其中包含超过400,000个独特功能，包括在训练集中至少有两个文本中发生的词汇和POS N-GRAM。我们的系统确定了测试集中的最正确的本地语言隶属度为83.6％的文本。这是2013年NLI共享任务所取得的最高分类准确性。

著录项

来源
《Workshop on Innovative Use of NLP for Building Educational Applications 》|2013年||共8页
会议地点
作者
Scott Jarvis; Yves Bestgen; Steve Pepper;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程 ;
关键词

相似文献

外文文献
中文文献
专利

1. 语言测试中的语音识别技术——在已英语为母语的环境中参加测试重要吗？ [J] . Marina DODIGOVIC 中国应用语言学：英文版 . 2015 ,第003期
2. A Pre-classification-Based Language Identification for Northeast Indian Languages Using Prosody and Spectral Features [J] . Bhanja Chuya China, Laskar Mohammad Azharuddin, Laskar Rabul Hussain Circuits, systems, and signal processing . 2019 ,第5期

机译：基于韵律和谱特征的东北印度语言基于分类的语言识别
3. Native Language Identification of Fluent and Advanced Non-Native Writers [J] . Sarwar Raheem, Rutherford Attapol T., Hassan Saeed-Ul, ACM transactions on Asian and low-resource language information processing . 2020 ,第4期

机译：流利和先进的非本土作家的母语识别
4. Vowel identification in temporal-modulated noise for native and non-native listeners: Effect of language experience [J] . Guan Jingjing, Liu Chang, Tao Sha, The Journal of the Acoustical Society of America . 2015 ,第3aPta1期

机译：本地和非本地听众在时间调制噪声中的元音识别：语言体验的影响
5. Maximizing Classification Accuracy in Native Language Identification [C] . Scott Jarvis, Yves Bestgen, Steve Pepper Workshop on Innovative Use of NLP for Building Educational Applications . 2013

机译：在母语识别中最大程度地提高分类精度
6. Examining the Validity of Classifications from an English Language Proficiency Assessment for English Language Learners and Native English Speakers in Fifth Grade. [D] . Carroll, Patricia Elaine. 2012

机译：从英语水平评估中对英语学习者和五年级英语母语者进行分类的有效性检验。
7. A Classification of Bioinformatics Algorithms from the Viewpoint of Maximizing Expected Accuracy (MEA) [O] . Michiaki Hamada, Kiyoshi Asai -1

机译：从最大化期望准确性（MEA）的角度对生物信息学算法进行分类
8. Can characters reveal your native language? A language-independent approach to native language identification [O] . Radu Tudor Ionescu, Marius Popescu, Aoife Cahill 2015

机译：角色可以揭示您的母语吗？与语言无关的本地语言识别方法

Maximizing Classification Accuracy in Native Language Identification

摘要

著录项

相似文献

相关主题

期刊订阅