...
首页> 外文期刊>JMLR: Workshop and Conference Proceedings >ScanMap Supervised Confounding Aware Non-negative Matrix Factorization for Polygenic Risk Modeling
【24h】

ScanMap Supervised Confounding Aware Non-negative Matrix Factorization for Polygenic Risk Modeling

机译:ScanMap监督多基因风险建模的混淆意识非负矩阵分解

获取原文
           

摘要

Molecular mechanisms are important to inform targeted intervention and are often encoded in gene sets or pathways. Existing machine learning approaches often face challenges in simultaneously reducing the high dimensionality and learning effective features that are discriminative in predicting the disease types with the usual presence of confounding variables. We aim to improve accuracy and interpretability of prediction models by introducing Supervised Confounding Aware Non-negative Matrix Factorization for Polygenic Risk Modeling (ScanMap) for genetic studies. ScanMap selects informative groups of genes that embody multiple interacting molecular functions by using a supervised model that integrates both groups of genes and confounding variables in predicting disease type and status. The learned groups of genes reflect interacting molecular mechanisms, which are suitable features for polygenic risk modeling. These learned features are then used in training a softmax classifier for disease type and status prediction. We evaluated ScanMap against multiple state-of-the-art unsupervised and supervised matrix factorization models using large scale NGS datasets. ScanMap outperformed all comparison models significantly (p < 0:05). Feature analysis was performed to illuminate the insights and benefits of gene groups learned by ScanMap in disease risk prediction.
机译:分子机制对于通知有针对性干预的重要性是重要的,并且通常以基因组或途径编码。现有的机器学习方法经常面临挑战,同时减少具有判别预测疾病类型的高维性和学习有效特征,这些特征在于通常存在混淆变量的常规存在。我们旨在通过引入遗传研究的多基因风险建模(ScanMap)来提高预测模型的准确性和可解释性。 ScanMap选择通过使用将两组基因和混淆变量集成在预测疾病类型和地位中的监督模型来选择多种相互作用分子功能的信息组。学习基因组反映了相互作用的分子机制,这是多种基因风险建模的合适特征。然后,这些学习的功能将用于训练Softmax分类器以进行疾病类型和状态预测。我们使用大规模NGS数据集评估了针对多个最先进的无监督和监督矩阵分解模型的ScanMap。 ScanMap显着优于所有比较模型(P <0:05)。进行特征分析,以照亮Scanmap在疾病风险预测中学到的基因团体的见解和益处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号