首页> 外文OA文献 >Brief review of regression‐based and machine learning methods in genetic epidemiology: the Genetic Analysis Workshop 17 experience
【2h】

Brief review of regression‐based and machine learning methods in genetic epidemiology: the Genetic Analysis Workshop 17 experience

机译:遗传流行病学中基于回归和机器学习方法的简要回顾:遗传分析研讨会17的经验

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Genetics Analysis Workshop 17 provided common and rare genetic variants from exome sequencing data and simulated binary and quantitative traits in 200 replicates. We provide a brief review of the machine learning and regression‐based methods used in the analyses of these data. Several regression and machine learning methods were used to address different problems inherent in the analyses of these data, which are high‐dimension, low‐sample‐size data typical of many genetic association studies. Unsupervised methods, such as cluster analysis, were used for data segmentation and, subset selection. Supervised learning methods, which include regression‐based methods (e.g., generalized linear models, logic regression, and regularized regression) and tree‐based methods (e.g., decision trees and random forests), were used for variable selection (selecting genetic and clinical features most associated or predictive of outcome) and prediction (developing models using common and rare genetic variants to accurately predict outcome), with the outcome being case‐control status or quantitative trait value. We include a discussion of cross‐validation for model selection and assessment, and a description of available software resources for these methods. Genet. Epidemiol . 35:S5–S11, 2011. © 2011 Wiley Periodicals, Inc.
机译:遗传分析研讨会17提供了200个重复样本中外显子组测序数据以及模拟的二进制和定量性状的常见和稀有遗传变异。我们对这些数据的分析中使用的基于机器学习和回归的方法进行了简要回顾。几种回归和机器学习方法被用来解决这些数据分析中固有的不同问题,这些是许多遗传关联研究中典型的高维,低样本量数据。无监督方法(例如聚类分析)用于数据分割和子集选择。有监督的学习方法,包括基于回归的方法(例如广义线性模型,逻辑回归和正则回归)和基于树的方法(例如决策树和随机森林),用于变量选择(选择遗传和临床特征)和结果的最相关或预测)和预测(使用常见和罕见的遗传变异体开发模型以准确预测结果),结果为病例对照状态或定量特征值。我们讨论了模型选择和评估的交叉验证,并描述了这些方法的可用软件资源。基因流行病。 35:S5–S11,2011。©2011 Wiley Periodicals,Inc.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号