首页> 外文期刊>Human Heredity >A New System Identification Approach to Identify Genetic Variants in Sequencing Studies for a Binary Phenotype
【24h】

A New System Identification Approach to Identify Genetic Variants in Sequencing Studies for a Binary Phenotype

机译:在二元表型测序研究中鉴定遗传变异的新系统鉴定方法

获取原文
获取原文并翻译 | 示例
           

摘要

We propose in this paper a set-valued (SV) system model, which is a generalized form of logistic (LG) and Probit (Probit) regression, to be considered as a method for discovering genetic variants, especially rare genetic variants in next-generation sequencing studies, for a binary phenotype. We propose a new SV system identification method to estimate all underlying key system parameters for the Probit model and compare it with the LG model in the setting of genetic association studies. Across an extensive series of simulation studies, the Probit method maintained type I error control and had similar or greater power than the LG method, which is robust to different distributions of noise: logistic, normal, or t distributions. Additionally, the Probit association parameter estimate was 2.7-46.8-fold less variable than the LG log-odds ratio association parameter estimate. Less variability in the association parameter estimate translates to greater power and robustness across the spectrum of minor allele frequencies (MAFs), and these advantages are the most pronounced for rare variants. For instance, in a simulation that generated data from an additive logistic model with an odds ratio of 7.4 for a rare single nucleotide polymorphism with a MAF of 0.005 and a sample size of 2,300, the Probit method had 60% power whereas the LG method had 25% power at the α = 10-6 level. Consistent with these simulation results, the set of variants identified by the LG method was a subset of those identified by the Probit method in two example analyses. Thus, we suggest the Probit method may be a competitive alternative to the LG method in genetic association studies such as candidate gene, genome-wide, or next-generation sequencing studies for a binary phenotype.
机译:我们在本文中提出了一种集值(SV)系统模型,该模型是logistic(LG)和Probit(Probit)回归的广义形式,被认为是发现遗传变异的一种方法,尤其是在下一个发现中的稀有遗传变异。二元表型的遗传测序研究。我们提出了一种新的SV系统识别方法,以估算Probit模型的所有基本关键系统参数,并将其与LG模型进行遗传关联研究。在一系列广泛的模拟研究中,Probit方法保持了I型错误控制,并且具有与LG方法相似或更高的功效,该方法对于不同的噪声分布(逻辑分布,正态分布或t分布)具有鲁棒性。此外,Probit关联参数估计的变量比LG对数比比率关联参数估计的变量少2.7-46.8倍。关联参数估计的较小可变性可在较小等位基因频率(MAF)的频谱上转化为更大的功率和鲁棒性,而这些优势在稀有变体中最为明显。例如,在一个模拟中,该数据来自一个加法逻辑模型,该模型的奇数比为7.4,用于MAF为0.005,样本大小为2,300的罕见单核苷酸多态性,Probit方法的功效为60%,而LG方法的功效为60%。在α= 10-6时,功率为25%。与这些仿真结果一致,在两个示例分析中,通过LG方法识别的变体集是通过Probit方法识别的变体的子集。因此,我们建议在遗传关联研究(例如候选基因,全基因组或二元表型的下一代测序研究)中,Probit方法可能是LG方法的竞争替代方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号