Term Frequency-Inverse Document Frequency Answer Categorization with Support Vector Machine on Automatic Short Essay Grading System with Latent Semantic Analysis for Japanese Language

机译：带有潜在语义分析的日语自动短文评分系统上的支持向量机词频逆文档频次答案分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, conducted a research to increase accuracy of Japanese language automatic short essay grading system. Japanese short answers are processed with a supervised machine learning algorithm; Support Vector Machine (SVM) before entering the system that used Latent Semantic Analysis (LSA). The SVM is used to classify short answers topics that minimize error in assessing the essay. TF-IDF process is done as an input to the SVM to weigh every keyword in a sentence. Then, the result will be processed with LSA. LSA uses Singular Value Decomposition (SVD) as the main process and Frobenius Norm as the final calculation from the result of SVD. Using linear kernel in SVM, the accuracy obtained in classifying short answers topics from Japanese-written short answers is 96.36% with 10.0 to 100.0 penalty values and 0.5 training portion. The accuracy score obtained from LSA is as much as 87.15% average with the input of TDM that shows frequency of a word's occurrence.

机译：本文对提高日语自动短文评分系统的准确性进行了研究。日语简短答案由监督的机器学习算法处理;进入使用潜在语义分析（LSA）的系统之前，请先使用支持向量机（SVM）。 SVM用于对简短答案主题进行分类，以最大程度地减少评估文章时的错误。 TF-IDF过程作为对SVM的输入来完成，以权衡句子中的每个关键字。然后，将使用LSA处理结果。 LSA使用奇异值分解（SVD）作为主要过程，并使用Frobenius范数作为根据SVD结果进行的最终计算。在SVM中使用线性核，从日语书面的简短答案中对简短答案主题进行分类的准确性为96.36％，惩罚值为10.0至100.0，训练部分为0.5。从LSA获得的准确度分数在显示词出现频率的TDM输入下平均高达87.15％。

著录项

来源
《International Conference on Electrical Engineering and Computer Science》|2019年|293-298|共6页
会议地点
作者
Anak Agung Putri Ratna; Aaliyah Kaltsum; Lea Santiar; Hanifah Khairunissa; Ihsan Ibrahim; Prima Dewi Purnamasari;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
natural language processing; pattern classification; singular value decomposition; supervised learning; support vector machines; text analysis;

机译：自然语言处理;模式分类;奇异值分解;监督学习;支持向量机;文本分析;

相似文献

外文文献
中文文献
专利

1. AUTOMATIC ESSAY GRADING SYSTEM FOR SHORT ANSWERS IN ENGLISH LANGUAGE [J] . Ali Muftah Ben Omran, Mohd Juzaiddin Ab Aziz Journal of computer sciences . 2013,第10期

机译：英语简短答卷自动分级系统
2. AUTOMATIC ESSAY GRADING SYSTEM FOR SHORT ANSWERS IN ENGLISH LANGUAGE | Science Publications [J] . Ali Muftah Ben Omran, Mohd Juzaiddin Ab Aziz Journal of computer sciences . 2013,第10期

机译：英语短答案自动分级系统|科学出版物
3. Automatic Article Summary with the Term Frequency-Inverse Document Frequency Algorithm for Information on Elderly Health [J] . Juli Sulaksono, Risky Aswi Ramadhani, Ratih Kumalasari Niswatin Journal of computational and theoretical nanoscience . 2020,第2a3期

机译：自动文章摘要与术语频率 - 逆文档频率算法，了解老年人健康信息
4. Term Frequency-Inverse Document Frequency Answer Categorization with Support Vector Machine on Automatic Short Essay Grading System with Latent Semantic Analysis for Japanese Language [C] . Anak Agung Putri Ratna, Aaliyah Kaltsum, Lea Santiar, International Conference on Electrical Engineering and Computer Science . 2019

机译：术语频率逆文档频率应答分类与支持向量机上的自动短篇论文分级系统，日语语言潜在语义分析
5. A machine-aided approach to intelligent index generation: Using natural language processing and latent semantic analysis to determine the contexts and relationships among words in a corpus. [D] . Lukon, Shelly Candita. 2006

机译：一种机器辅助的智能索引生成方法：使用自然语言处理和潜在语义分析来确定语料库中单词之间的上下文和关系。
6. Using Latent Semantic Analysis to Score Short Answer Constructed Responses: Automated Scoring of the Consequences Test [O] . Noelle LaVoie, James Parker, Peter J. Legree, 2020

机译：使用潜在语义分析来得分简短答案构建响应：后果测试的自动评分
7. An Enhanced Hybrid Feature Selection Technique Using Term Frequency-Inverse Document Frequency and Support Vector Machine-Recursive Feature Elimination for Sentiment Classification [O] . Nur Syafiqah Mohd Nafis, Suryanti Awang 2021

机译：具有术语频率 - 逆文档频率的增强混合特征选择技术，并支持传染媒介机递归特征消除情绪分类

Term Frequency-Inverse Document Frequency Answer Categorization with Support Vector Machine on Automatic Short Essay Grading System with Latent Semantic Analysis for Japanese Language

摘要

著录项

相似文献

相关主题

期刊订阅