Using machine learning models to classify stroke risk level based on national screening data ^*

机译：使用机器学习模型基于国家筛查数据^{* 对中风风险等级进行分类}

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the character of high incidence, high prevalence and high mortality, stroke has brought a heavy burden to families and society in China. In 2009, the Ministry of Health of China launched the China national stroke screening and intervention program, which screens stroke risk factors and conducts high-risk population interventions for people aged over 40 years old all over China. In this program, stroke risk factors include hypertension, diabetes, dyslipidemia, atrial fibrillation, smoking, lack of exercise, apparently overweight or obese and family history of stroke. People with more than two risk factors or with a history of stroke or transient ischemic attack (TIA) are considered as high-risk. However, it is impossible for this criterion to classify stroke risk level for people with "unknown" values in the fields of risk factors. The missing of stroke risk levels results in reduced efficiency of stroke interventions and inaccuracies in the statistical results at the national level. In this paper, firstly, we construct the training set and test set and process the imbalanced training set based on oversampling and undersampling method. Then, we develop logistic regression model, decision tree model, neural network model and random forest model for stroke risk classification, and evaluate these models based on the recall and precision. The results show that the model based on random forest achieves best performance considering recall and precision. The models constructed in this paper can improve the screening efficiency and avoid unnecessary rescreening and intervention expenditures.

机译：中风具有高发病率，高患病率和高死亡率的特点，给中国家庭和社会带来沉重负担。 2009年，中国卫生部启动了“中国中风筛查和干预计划”，该计划旨在筛查中风的危险因素，并对全国40岁以上的人群进行高危人群干预。在该计划中，中风的危险因素包括高血压，糖尿病，血脂异常，房颤，吸烟，缺乏运动，明显超重或肥胖以及中风家族史。具有两个以上危险因素或具有中风或短暂性脑缺血发作（TIA）历史的人被视为高危人群。但是，对于在风险因素领域中具有“未知”值的人，此标准不可能对中风风险水平进行分类。中风风险水平的缺失导致中风干预措施的效率降低，并且国家一级的统计结果不准确。本文首先建立训练集和测试集，并基于过采样和欠采样方法处理不平衡训练集。然后，我们开发了用于中风风险分类的逻辑回归模型，决策树模型，神经网络模型和随机森林模型，并基于召回率和精度对这些模型进行了评估。结果表明，基于召回率和精度，基于随机森林的模型取得了最佳性能。本文构建的模型可以提高筛查效率，避免不必要的重新筛查和干预支出。

著录项

来源
《Annual International Conference of the IEEE Engineering in Medicine and Biology Society》|2019年|1386-1390|共5页
会议地点
作者
Xuemeng Li; Di Bian; JingHui Yu; HuaJian Mao; Mei Li; Dongsheng Zhao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
History; Decision trees; Stroke (medical condition); Biological system modeling; Neural networks; Sociology; Statistics;

机译：历史;决策树;中风（医学状况）;生物系统建模;神经网络;社会学;统计;

相似文献

外文文献
中文文献
专利

1. HYBRID MODEL FOR TWITTER DATA SENTIMENT ANALYSIS BASED ON ENSEMBLE OF DICTIONARY BASED CLASSIFIER AND STACKED MACHINE LEARNING CLASSIFIERS-SVM, KNN AND C5.0 [J] . SANGEETA RANI, NASIB SINGH GILL Journal of Theoretical and Applied Information Technology . 2020,第4期

机译：基于字典的分类器和堆叠机学习分类分类的基于词典的Twitter数据情绪分析混合模型 - SVM，KNN和C5.0
2. Comparative of Machine Learning Algorithms and Datasets to Classify Natural Coverage in the Cajas National Park (Ecuador) Based on GEOBIA Approach [J] . Diego Pacheco Prado, Luis ángel Ruiz Proceedings . 2019,第1期

机译：机器学习算法和数据集的比较基于Geobia方法对Cajas国家公园（厄瓜多尔）的自然报道进行分类
3. Evaluation of Machine Learning Models for Classifying Upper Extremity Exercises Using Inertial Measurement Unit-Based Kinematic Data [J] . Hua Andrew, Chaudhari Pratik, Johnson Nicole, Biomedical and Health Informatics, IEEE Journal of . 2020,第9期

机译：基于惯性测量单元的运动学数据进行分类的机器学习模型的评估
4. Using machine learning models to classify stroke risk level based on national screening data * [C] . Xuemeng Li, Di Bian, JingHui Yu, Annual International Conference of the IEEE Engineering in Medicine and Biology Society . 2019

机译：使用机器学习模型基于国家筛选数据 * 对笔划风险级别分类
5. Applying Supervised Machine Learning Models to Classify Day Traders Using the Traders' Daily Trading Activity Data and the U.S. Stock Market Indices Data [D] . Sharara, Nasser Ateya 2018

机译：应用监督机器学习模型以使用交易者的每日交易活动数据和美国股票市场指数数据对日间交易者进行分类
6. Using machine learning models to improve stroke risk level classification methods of China national stroke screening [O] . Xuemeng Li, Di Bian, Jinghui Yu, 2019

机译：使用机器学习模型改进中国国家卒中筛查的卒中风险等级分类方法
7. Using machine learning models to improve stroke risk level classification methods of China national stroke screening [O] . Xuemeng Li, Di Bian, Jinghui Yu, 2019

机译：采用机器学习模型提高中国中风筛查的行程风险等级分类方法
8. Novel Machine Learning Classifier Based on a Qualia Modeling Agent (QMA). [R] . Vaughan, S. L. 2016

机译：基于Qualia建模代理（Qma）的新型机器学习分类器。

Using machine learning models to classify stroke risk level based on national screening data *

摘要

著录项

相似文献

相关主题

期刊订阅

Using machine learning models to classify stroke risk level based on national screening data ^*