首页> 外文期刊>Bioinformatics >MISS: a non-linear methodology based on mutual information for genetic association studies in both population and sib-pairs analysis
【24h】

MISS: a non-linear methodology based on mutual information for genetic association studies in both population and sib-pairs analysis

机译:MISS:一种基于相互信息的非线性方法,用于群体和同胞对分析中的遗传关联研究

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Motivation: Finding association between genetic variants and phenotypes related to disease has become an important vehicle for the study of complex disorders. In this context, multi-loci genetic association might unravel additional information when compared with single loci search. The main goal of this work is to propose a non-linear methodology based on information theory for finding combinatorial association between multi-SNPs and a given phenotype.Results: The proposed methodology, called MISS (mutual information statistical significance), has been integrated jointly with a feature selection algorithm and has been tested on a synthetic dataset with a controlled phenotype and in the particular case of the F7 gene. The MISS methodology has been contrasted with a multiple linear regression (MLR) method used for genetic association in both, a population-based study and a sib-pairs analysis and with the maximum entropy conditional probability modelling (MECPM) method, which searches for predictive multi-locus interactions. Several sets of SNPs within the F7 gene region have been found to show a significant correlation with the FVII levels in blood. The proposed multi-site approach unveils combinations of SNPs that explain more significant information of the phenotype than their individual polymorphisms. MISS is able to find more correlations between SNPs and the phenotype than MLR and MECPM. Most of the marked SNPs appear in the literature as functional variants with real effect on the protein FVII levels in blood.
机译:动机:寻找与疾病相关的遗传变异和表型之间的关联已成为研究复杂疾病的重要工具。在这种情况下,与单基因座搜索相比,多基因座遗传关联可能会揭示其他信息。这项工作的主要目的是提出一种基于信息论的非线性方法,用于寻找多个SNP与给定表型之间的组合关联。结果:所提出的方法被称为MISS(相互信息统计意义),已被整合在一起具有特征选择算法,并且已经在具有受控表型的合成数据集上进行了测试,并且在F7基因的特殊情况下也进行了测试。 MISS方法与基于群体的研究和同胞对分析中用于遗传关联的多元线性回归(MLR)方法以及用于搜索预测性的最大熵条件概率模型(MECPM)方法形成对比。多场所互动。已发现F7基因区域内的几组SNP与血液中的FVII水平显着相关。提议的多位点方法揭示了SNP的组合,这些组合比其各自的多态性能解释更多的表型信息。与MLR和MECPM相比,MISS能够发现SNP与表型之间的更多相关性。大多数标记的SNP在文献中以功能性变体形式出现,对血液中FVII蛋白水平有实际影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号