首页> 美国卫生研究院文献>BMC Proceedings >Performance of random forests and logic regression methods using mini-exome sequence data
【2h】

Performance of random forests and logic regression methods using mini-exome sequence data

机译:使用小型外显子组序列数据的随机森林性能和逻辑回归方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Machine learning approaches are an attractive option for analyzing large-scale data to detect genetic variants that contribute to variation of a quantitative trait, without requiring specific distributional assumptions. We evaluate two machine learning methods, random forests and logic regression, and compare them to standard simple univariate linear regression, using the Genetic Analysis Workshop 17 mini-exome data. We also apply these methods after collapsing multiple rare variants within genes and within gene pathways. Linear regression and the random forest method performed better when rare variants were collapsed based on genes or gene pathways than when each variant was analyzed separately. Logic regression performed better when rare variants were collapsed based on genes rather than on pathways.
机译:机器学习方法是分析大规模数据以检测有助于定量性状变异的遗传变异的一种有吸引力的选择,而无需特定的分布假设。我们评估了两种机器学习方法,即随机森林和逻辑回归,并将它们与标准简单单变量线性回归进行比较,并使用了遗传分析研讨会17的小型外显子组数据。在基因和基因途径内折叠多个罕见变体后,我们也应用这些方法。当基于基因或基因途径折叠稀有变体时,线性回归和随机森林方法的效果要比分别分析每个变体时更好。当基于基因而非途径折叠罕见变体时,逻辑回归表现更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号