首页> 外文会议>International Seminar on Application for Technology of Information and Communication >A Comparison of Data Mining Techniques for Suicide Attempt Characteristics Mapping and Prediction
【24h】

A Comparison of Data Mining Techniques for Suicide Attempt Characteristics Mapping and Prediction

机译:自杀试图特征映射和预测数据挖掘技术的比较

获取原文

摘要

According to the report by Khon Kaen Rajanagarin-dra Psychiatric hospital, there are many factors such as poverty, loss, disappointment, alcohol addiction, addiction, chronic physical illness, depression, economic problems and major life changes that cause suicide attempts. However, there are some unsuccessful suicides. These individuals might try to repeatedly commit suicide attempts. In this paper, the aims of this research are to compare ID3, C4.5 and na?ve Bayes and predict the characteristics of individuals who have suicidal ideation to repeatedly commit suicide attempts by using data mining techniques. After data preprocessing, three data mining techniques were selected to compare the efficiency of the single of classification models and the ensemble models. The researchers also adopted synthetic minority over-sampling technique (SMOTE) to balance the suicidal ideation of reattempt and no attempt classes. The researchers used 57 attributes from self-harm surveillance report (RP.506S) of Khon Kaen Rajanagarindra Psychiatric hospital. The experimental results of the single method showed that C4.5 is the best model with the highest percentage of correctly classified instances rate of 90.69%, while the ensemble model, voting method showed the best results of 91.26. We found the number of characteristics from C4.5 is 474 patterns. There are people, who have suicidal ideation over 500 in each case is 42 patterns. In 42 patterns, there are 22 patterns that more than 90% accuracy.
机译:根据Khon Kaen Rajanagarin-Dra精神病院的报告,有许多因素如贫困,失望,酒精成瘾,成瘾,慢性身体疾病,抑郁,经济问题和主要生活变化,导致自杀。但是,有一些不成功的自杀。这些人可能会尝试反复犯下自杀的尝试。在本文中,这项研究的目的是比较ID3,C4.5和朴素贝叶斯和预测谁有自杀意念,通过使用数据挖掘技术多次自杀企图的个人特点。在数据预处理之后,选择了三种数据挖掘技术以比较单个分类模型和集合模型的效率。研究人员还采用了合成少数群体过采样技术(SMOTE),以平衡重新尝试的自杀意图,没有尝试课程。研究人员使用了57个属性来自Khon Kaen Rajanagarindra精神病院的自我伤害监督报告(RP.506S)。单一方法的实验结果表明,C4.5是最高百分比的最高型号,正确分类的实例率为90.69%,而合奏模型,投票方式显示出91.26的最佳结果。我们发现来自C4.5的特征数量是474模式。有些人在每种情况下有500多种的人,是42种模式。在42个模式中,有22种模式,精度超过90%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号