...
首页> 外文期刊>Journal of computational biology: A journal of computational molecular cell biology >Machine Learning-Based Method for Obesity Risk Evaluation Using Single-Nucleotide Polymorphisms Derived from Next-Generation Sequencing
【24h】

Machine Learning-Based Method for Obesity Risk Evaluation Using Single-Nucleotide Polymorphisms Derived from Next-Generation Sequencing

机译:基于机器学习的肥胖风险评估方法,使用下一代测序衍生的单核苷酸多态性

获取原文
获取原文并翻译 | 示例
           

摘要

Obesity is a major risk factor for many metabolic diseases. To understand the genetic characteristics of obese individuals, single-nucleotide polymorphisms (SNPs) derived from next-generation sequencing (NGS) provide comprehensive insight into genome-wide genetic investigation. However, interpretation of these SNP data for clinical application is difficult given the high complexity of NGS data. Hence, in this study, obesity risk prediction models based on SNPs were designed using machine learning (ML) methods, namely support vector machine (SVM), k-nearest neighbor, and decision tree (DT). This investigation obtained clinicopathological features, including 130 SNPs, sex, and age, from 139 eligible individuals. Various feature selection methods, such as stepwise multivariate linear regression (MLR), DT, and genetic algorithms, were applied to select informative features for generating obesity prediction models. Multivariate logistic regression was used to evaluate the importance of the selected features. The models trained from various features evaluated their predictive performances based on fivefold cross-validation. Three measures, namely accuracy, sensitivity, and specificity, were used to examine and compare the predictive power among various models. To design obesity prediction models using ML methods, nine SNPs, including rs10501087, rs17700144, rs2287019, rs534870, rs660339, rs7081678, rs718314, rs9816226, and rs984222, were selected based on stepwise MLR. In evaluation of model performance, the SVM model significantly outperformed other classifiers based on the same training features. The SVM model exhibits 70.77% accuracy, 80.09% sensitivity, and 63.02% specificity. This investigation has demonstrated that the selected SNPs were effective in the detection of obesity risk. Additionally, the ML-based method provides a feasible mean for conducting preliminary analyses of genetic characteristics of obesity.
机译:肥胖是许多代谢疾病的主要危险因素。要了解肥胖个体的遗传特征,来自下一代测序(NGS)的单核苷酸多态性(SNP)可以综合深入了解基因组族遗传调查。然而,鉴于NGS数据的高复杂性,对临床应用的这些SNP数据的解释是困难的。因此,在本研究中,使用机器学习(ML)方法设计了基于SNP的肥胖风险预测模型,即支持向量机(SVM),K最近邻居和决策树(DT)。该调查获得了139名符合条件的临床病理特征,包括130名SNP,性别和年龄。应用各种特征选择方法,例如逐步多变量线性回归(MLR),DT和遗传算法,用于选择用于产生肥胖预测模型的信息特征。多变量逻辑回归用于评估所选功能的重要性。从各种功能训练的模型基于五倍交叉验证评估了它们的预测性能。三项措施,即精度,敏感性和特异性,用于检查各种模型之间的预测力。根据ML方法设计肥胖预测模型,基于逐步的MLR选择九个SNP,包括RS10501087,RS17700144,RS2287019,RS5348,RS718314,RS981626,RS718314,RS981626和RS984222。在评估模型性能方面,SVM模型基于相同的训练功能显着优于其他分类器。 SVM模型的精度为70.77%,灵敏度80.09%和63.02%。该研究表明,所选的SNPS在检测肥胖风险中有效。另外,基于ML的方法提供了用于进行肥胖遗传特征的初步分析的可行性意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号