Improving Automatic Essay Scoring for Indonesian Language using Simpler Model and Richer Feature

Rian Adam Rajagede

首页> 外文期刊>Kinetik >Improving Automatic Essay Scoring for Indonesian Language using Simpler Model and Richer Feature

【24h】

Improving Automatic Essay Scoring for Indonesian Language using Simpler Model and Richer Feature

机译：使用简单模型和更丰富的功能改进印度尼西亚语言的自动论文评分

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic essay scoring is a machine learning task where we create a model that can automatically assess student essay answers. Automated essay scoring will be instrumental when the answer assessment process is on a large scale so that manual correction by humans can cause several problems. In 2019, the Ukara dataset was released for automatic essay scoring in the Indonesian language. The best model that has been published using the dataset produces an F1-score of 0.821 using pre-trained fastText sentence embedding and the stacking model between the neural network and XGBoost. In this study, we propose to use a simpler classifier model using a single hidden layer neural network but using a richer feature, namely BERT sentence embedding. Pre-trained model BERT sentence embedding extracts more information from sentences but has a smaller file size than fastText pre-trained model. The best model we propose manages to get a higher F1-score than the previous models on the Ukara dataset, which is 0.829.

机译：自动论文评分是一种机器学习任务，我们创建了一个可以自动评估学生论文答案的型号。当答案评估过程大规模时，自动化的论文评分将是有助于的，以便人类的手动校正可能导致几个问题。 2019年，Ukara DataSet在印度尼西亚语言中自动发布了自动论文评分。使用DataSet发布的最佳模型，使用预先培训的FastText句子和神经网络与XGBoost之间的堆叠模型产生0.821的F1分数。在本研究中，我们建议使用单个隐藏层神经网络使用更简单的分类器模型，但使用更丰富的功能，即BERT句子嵌入。预先接受的模型BERT句嵌入从句子中提取更多信息，但具有比FastText预训练模型更小的文件大小。我们提出的最佳模型管理比UKARA数据集上的以前模型更高的F1分数，这是0.829。

著录项

来源
《Kinetik》 |2021年第1期|共8页
作者
Rian Adam Rajagede;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
Automatic Essay ScoringBERT Sentence EmbeddingNeural Network;

机译：自动论文得分判决嵌入式网络;

相似文献

外文文献
中文文献
专利

1. SIMPLE: System Automatic Essay Assessment for Indonesian Language Subject Examination [J] . Anak Agung Ratna, Bagio Budiardjo, Djoko Hartanto Makara Seri Teknologi . 2010,第1期

机译：简单：印度尼西亚语科目考试系统自动论文评估
2. Automatic Essay Scoring in E-learning System Using LSA Method with N-Gram Feature for Bahasa Indonesia [J] . Rico Setiadi Citawan, Viny Christanti Mawardi, Bagus Mulyawan MATEC Web of Conferences . 2018,第1期

机译：使用LSA方法和N-Gram功能的Bahasa Indonesia电子学习系统中的自动作文评分
3. Incorporating learning characteristics into automatic essay scoring models: What individual differences and linguistic features tell us about writing quality. [J] . Scott Crossley, Laura K Allen, Erica L Snow, Journal of Educational Data Mining . 2016,第2期

机译：将学习特征纳入自动论文评分模型：个体差异和语言特征告诉我们关于写作质量的信息。
4. Automatic Essay Scoring for Indonesian Short Answers using Siamese Manhattan Long Short-Term Memory [C] . Akmal Ramadhan Arifin, Prima Dewi Purnamasari, Anak Agung Putri Ratna International Conference on Electrical, Communication and Computer Engineering . 2021

机译：印度尼西亚短答案的自动论文评分使用暹罗曼哈顿长期短期记忆
5. Will the Introduction of a Critical Questioning Technique and the Toulmin Model Improve the Argumentative Essay Writing Scores of Students in an Eighth Grade English Language Arts Class? [D] . Zimmerbaum, Kate Z. 2014

机译：引入批判性质疑技术和Toulmin模型将改善八级英语艺术类学生的争论论文写作分数吗？
6. Weighted Feature Significance: A Simple Interpretable Model of Compound Toxicity Based on the Statistical Enrichment of Structural Features [O] . Ruili Huang, Noel Southall, Menghang Xia, -1

机译：加权特征的重要性：基于结构特征统计丰富性的简单可解释的复合毒性模型
7. SIMPLE: System Automatic Essay Assessment for Indonesian Language Subject Examination [O] . Anak Agung Ratna, Bagio Budiardjo, Djoko Hartanto 2010

机译：简单：印尼语言科目考试自动评估系统

Improving Automatic Essay Scoring for Indonesian Language using Simpler Model and Richer Feature

摘要

著录项

相似文献

相关主题

期刊订阅