首页> 美国卫生研究院文献>Springer Open Choice >Prediction of skin color tanning and freckling from DNA in Polish population: linear regression random forest and neural network approaches
【2h】

Prediction of skin color tanning and freckling from DNA in Polish population: linear regression random forest and neural network approaches

机译:从波兰人口的DNA预测肤色晒黑和雀斑:线性回归随机森林和神经网络方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Predicting phenotypes from DNA has recently become extensively studied field in forensic research and is referred to as Forensic DNA Phenotyping. Systems based on single nucleotide polymorphisms for accurate prediction of iris, hair and skin color in global population, independent of bio-geographical ancestry, have recently been introduced. Here, we analyzed 14 SNPs for distinct skin pigmentation traits in a homogeneous cohort of 222 Polish subjects. We compared three different algorithms: General Linear Model based on logistic regression, Random Forest and Neural Network in 18 developed prediction models. We demonstrate Random Forest to be the most accurate algorithm for 3- and 4-category estimations (total of 58.3% correct calls for skin color prediction, 47.2% for tanning prediction, 50% for freckling prediction). Binomial Logistic Regression was the best approach in 2-category estimations (total of 69.4% correct calls, AUC = 0.673 for tanning prediction; total of 52.8% correct calls, AUC = 0.537 for freckling prediction). Our study confirms the association of rs12913832 (HERC2) with all three skin pigmentation traits, but also variants associated solely with certain pigmentation traits, namely rs6058017 and rs4911414 (ASIP) with skin sensitivity to sun and tanning abilities, rs12203592 (IRF4) with freckling and rs4778241 and rs4778138 (OCA2) with skin color and tanning. Finally, we assessed significant differences in allele frequencies in comparison with CEU data and our study provides a starting point for the development of prediction models for homogeneous populations with less internal differentiation than in the global predictive testing.Electronic supplementary materialThe online version of this article (10.1007/s00439-019-02012-w) contains supplementary material, which is available to authorized users.
机译:从DNA预测表型最近已成为法医学研究的广泛研究领域,被称为法医学DNA表型分析。最近已经引入了基于单核苷酸多态性的系统,用于准确预测全球人口中的虹膜,头发和肤色,而与生物地理谱系无关。在这里,我们分析了222名波兰受试者的同类队列中14种SNP的独特皮肤色素沉着特征。我们在18种已开发的预测模型中比较了三种不同的算法:基于逻辑回归的通用线性模型,随机森林和神经网络。我们证明随机森林是3类和4类估计最准确的算法(正确的总肤色预测为58.3%,晒黑预测为47.2%,雀斑预测为50%)。二项式Logistic回归是2类估计中的最佳方法(总计69.4%正确检出,鞣制预测为AUC = 0.673;总计52.8%正确检出,雀斑预测为AUC = 0.537)。我们的研究证实了rs12913832(HERC2)与所有三个皮肤色素沉着性状的关联,但也证实了仅与某些色素沉着性状相关的变体,即具有皮肤对阳光和晒黑能力敏感的rs6058017和rs4911414(ASIP),具有雀斑和黑斑的rs12203592(IRF4)。 rs4778241和rs4778138(OCA2),肤色和晒黑。最后,我们评估了与CEU数据相比等位基因频率的显着差异,并且我们的研究为开发具有比全球预测测试更少内部差异的同质人群预测模型提供了起点。 10.1007 / s00439-019-02012-w)包含补充材料,授权用户可以使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号