...
首页> 外文期刊>International journal of data mining and bioinformatics >Statistical analysis for aggregated count data in genetic association studies
【24h】

Statistical analysis for aggregated count data in genetic association studies

机译:遗传关联研究中汇总计数数据的统计分析

获取原文
获取原文并翻译 | 示例
           

摘要

In smoking behaviour studies, Cigarette Counts Per Day (CPD) are aggregated such as 0, one pack, two packs, etc. Analysis of such count data is a challenge, owing to its reporting bias and difficulty in estimating its appropriate distribution. In this study, we set forth to identify genetic variants, such as Single Nucleotide Polymorphisms (SNPs), that correlate with aggregated count data, such as CPD. We first reviewed the existing approaches, in which the aggregated count data is a dependent variable and the SNP is an ordinal independent variable. We then considered a calibration model in which the SNP is the ordinal dependent variable and the aggregated count data is the independent variable. This calibration modelling approach becomes robust to accommodate distributional assumptions of count data. We applied our robust calibration modelling approach to CPD data from the Korean Association Resource project data of 4183 male samples. Through simulation studies, we investigated the performance of the proposed method for comparison to other competing approaches.
机译:在吸烟行为研究中,每天的香烟计数(CPD)汇总为0,一包,两包等。由于其报告偏差和难以估计其适当分布,因此对此类计数数据进行分析是一个挑战。在这项研究中,我们着手鉴定与累积计数数据(例如CPD)相关的遗传变异,例如单核苷酸多态性(SNP)。我们首先回顾了现有方法,其中合计计数数据是因变量,而SNP是有序独立变量。然后,我们考虑了一个校准模型,其中SNP是有序因变量,聚合计数数据是自变量。这种校准建模方法变得健壮,可以适应计数数据的分布假设。我们将强大的校准建模方法应用于来自韩国协会资源项目4183个男性样本的CPD数据。通过仿真研究,我们调查了所提出方法与其他竞争方法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号