...
首页> 外文期刊>Translational psychiatry. >Improved metabolomic data-based prediction of depressive symptoms using nonlinear machine learning with feature selection
【24h】

Improved metabolomic data-based prediction of depressive symptoms using nonlinear machine learning with feature selection

机译:使用非线性机器学习与特征选择改善基于代谢数据的抑郁症状预测

获取原文

摘要

To solve major limitations in algorithms for the metabolite-based prediction of psychiatric phenotypes, a novel prediction model for depressive symptoms based on nonlinear feature selection machine learning, the Hilbert–Schmidt independence criterion least absolute shrinkage and selection operator (HSIC Lasso) algorithm, was developed and applied to a metabolomic dataset with the largest sample size to date. In total, 897 population-based subjects were recruited from the communities affected by the Great East Japan Earthquake; 306 metabolite features (37 metabolites identified by nuclear magnetic resonance measurements and 269 characterized metabolites based on the intensities from mass spectrometry) were utilized to build prediction models for depressive symptoms as evaluated by the Center for Epidemiologic Studies-Depression Scale (CES-D). The nested fivefold cross-validation was used for developing and evaluating the prediction models. The HSIC Lasso-based prediction model showed better predictive power than the other prediction models, including Lasso, support vector machine, partial least squares, random forest, and neural network. l-leucine, 3-hydroxyisobutyrate, and gamma-linolenyl carnitine frequently contributed to the prediction. We have demonstrated that the HSIC Lasso-based prediction model integrating nonlinear feature selection showed improved predictive power for depressive symptoms based on metabolome data as well as on risk metabolites based on nonlinear statistics in the Japanese population. Further studies should use HSIC Lasso-based prediction models with different ethnicities to investigate the generality of each risk metabolite for predicting depressive symptoms.
机译:为了解决精神病毒表型的代谢物的预测算法的主要限制,基于非线性特征选择机学习的抑郁症状的新预测模型,希尔伯特 - 施密特独立性最低绝对收缩和选择运算符(HSIC套索)算法开发并应用于代谢物数据集,其样本大小最大。总共有897名人口受试者从大东日本地震影响的社区招募; 306代谢物特征(通过核磁共振测量的37种代谢物,并基于质谱法的强度鉴定的代谢物)用于构建由流行病学研究中心评估的抑郁症状的预测模型(CES-D)。嵌套的五倍交叉验证用于开发和评估预测模型。基于HSIC套索的预测模型显示出比其他预测模型更好的预测力,包括套索,支持向量机,部分最小二乘,随机林和神经网络。 L-亮氨酸,3-羟基异丁酸和γ-亚麻糖苷肉碱经常导致预测。我们已经证明,基于非线性特征选择的基于HSIC套索的预测模型显示了基于代谢数据的抑郁症状的预测力,以及基于日本人群非线性统计的风险代谢物。进一步的研究应该使用具有不同种族的HSIC套索的预测模型来研究每个风险代谢物的一般性,以预测抑郁症状。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号