首页> 外文会议>International Seminar on Application for Technology of Information and Communication >A Comparison of Data Mining Techniques for Suicide Attempt Characteristics Mapping and Prediction
【24h】

A Comparison of Data Mining Techniques for Suicide Attempt Characteristics Mapping and Prediction

机译:自杀尝试特征映射与预测数据挖掘技术的比较

获取原文

摘要

According to the report by Khon Kaen Rajanagarin-dra Psychiatric hospital, there are many factors such as poverty, loss, disappointment, alcohol addiction, addiction, chronic physical illness, depression, economic problems and major life changes that cause suicide attempts. However, there are some unsuccessful suicides. These individuals might try to repeatedly commit suicide attempts. In this paper, the aims of this research are to compare ID3, C4.5 and naïve Bayes and predict the characteristics of individuals who have suicidal ideation to repeatedly commit suicide attempts by using data mining techniques. After data preprocessing, three data mining techniques were selected to compare the efficiency of the single of classification models and the ensemble models. The researchers also adopted synthetic minority over-sampling technique (SMOTE) to balance the suicidal ideation of reattempt and no attempt classes. The researchers used 57 attributes from self-harm surveillance report (RP.506S) of Khon Kaen Rajanagarindra Psychiatric hospital. The experimental results of the single method showed that C4.5 is the best model with the highest percentage of correctly classified instances rate of 90.69%, while the ensemble model, voting method showed the best results of 91.26. We found the number of characteristics from C4.5 is 474 patterns. There are people, who have suicidal ideation over 500 in each case is 42 patterns. In 42 patterns, there are 22 patterns that more than 90% accuracy.
机译:根据孔敬(Khon Kaen Rajanagarin-dra)精神病医院的报告,有许多因素,例如贫穷,损失,失望,酒精成瘾,成瘾,慢性身体疾病,抑郁症,经济问题和导致自杀未遂的重大生活变化。但是,有一些自杀未果。这些人可能会尝试反复自杀。在本文中,这项研究的目的是比较ID3,C4.5和朴素的贝叶斯,并预测具有自杀意向的人的特征,这些人会通过使用数据挖掘技术来反复尝试自杀。经过数据预处理后,选择了三种数据挖掘技术来比较单个分类模型和集成模型的效率。研究人员还采用了合成少数派过采样技术(SMOTE),以平衡重试和无尝试班的自杀念头。研究人员使用了孔敬拉惹那加林德拉精神病医院的自我伤害监测报告(RP.506S)中的57个属性。单一方法的实验结果表明,C4.5是最佳模型,正确分类实例的百分比最高,为90.69%,而集成模型,投票方法则显示最佳结果,为91.26。我们发现C4.5的特征数量为474个模式。有些人的自杀意念超过500种,每种情况下有42种模式。在42种样式中,有22种样式的准确性超过90%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号