首页> 外文会议>EPIA Conference on Artificial Intelligence >A Data Mining Approach Applied to the High School National Examination: Analysis of Aspects of Candidates to Brazilian Universities
【24h】

A Data Mining Approach Applied to the High School National Examination: Analysis of Aspects of Candidates to Brazilian Universities

机译:一种用于高中国家考试的数据挖掘方法:巴西大学候选人的方面分析

获取原文

摘要

In many college courses in several countries are used exams in a national scale, such as Gaokao, in China, Scholastic Aptitude Test - SAT and the American College Testing - ACT in the United States of American, Yueksekoegretime Gecis Sinavi - YGS in Turkey, among others. This paper examines microdata from the High School National Examination (ENEM) database from Brazil. The database has 8,627,367 records, 166 attributes, and all experiments were performed based on the Spark architecture. The objective of this work is to examine microdata of the ENEM database applying data mining algorithms and creating an approach to handle big data and to predict the profile of those enrolled in ENEM. Through the standards found by the data mining algorithms with classification algorithms, it was possible to observe that family income, access to information, profession, and academic history of the parents were directly related to the performance of the candidates. And with a rules induction algorithm, it was possible to identify the patterns presented in each of the regions of Brazil, such as common characteristics when a candidate was approved and when not, essential factors as disciplines and particular characteristics of each region. This approach also enables the execution of large volumes of data in a simplified computational structure.
机译:在一些国家/地区的许多大学课程中,都使用了全国性的考试,例如中国的高考,美利坚合众国的学业能力测验-SAT和美国大学测验-ACT,土耳其的Yuksekoegretime Gecis Sinavi-YGS等。其他。本文研究了巴西高中国家考试(ENEM)数据库中的微数据。该数据库具有8,627,367条记录,166个属性,并且所有实验均基于Spark架构进行。这项工作的目的是使用数据挖掘算法检查ENEM数据库的微数据,并创建一种处理大数据的方法,并预测ENEM中已注册人员的概况。通过数据挖掘算法和分类算法找到的标准,可以观察到家庭收入,父母的信息获取,职业和学历与候选人的表现直接相关。借助规则归纳算法,有可能识别出巴西每个地区所呈现的模式,例如,候选人被批准时的共同特征,而不是候选人被批准时的共同特征,每个领域的纪律和特殊特征。这种方法还能够以简化的计算结构执行大量数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号