首页> 外文会议>IEEE International Conference on Bioinformatics and Biomedicine >Down syndrome prediction/screening model based on deep learning and illumina genotyping array
【24h】

Down syndrome prediction/screening model based on deep learning and illumina genotyping array

机译:基于深度学习和光照基因分型阵列的唐氏综合症预测/筛选模型

获取原文

摘要

Down syndrome (DS) is a genetic disorder with genome dosage imbalances and micro-duplications of human chromosome 21. It is usually associated with a group of serious diseases, including intellectual disabilities, cardiac diseases, physical abnormalities, and other abnormalities. Currently, since there is no cure for human DS, screening and early detection have become the most efficient way for DS prevention. In this study, we used deep learning techniques to build accurate DS prediction/screening models based on the analysis of newly introduced Illumina genotyping array. Specifically, we built chromosome SNP maps based on clinical genotyping data collected by Vanderbilt University Medical Center. Then we proposed a convolutional neural network (CNN) architecture with ten layers and two merged CNN models, which took two input chromosome SNP maps in combination. Our CNN DS prediction/screening model achieved over 99.3% average accuracy, as well as very low false positive and false negative rate, which are critical to disease prediction and screening in medical practice. It also had better performances in terms of all evaluating metrics when compared with three conventional machine-learning algorithms. Finally, we visualized the feature maps and the trained filter weights from intermediate layers of our trained CNN model. We further discussed the advantages of our method and the underlying reasons for its robust performance.
机译:唐氏综合症(DS)是一种遗传疾病,具有人类染色体21的基因组剂量失衡和微小重复,通常与一组严重疾病有关,包括智力障碍,心脏病,身体异常和其他异常。目前,由于无法治愈人类DS,因此筛查和早期发现已成为预防DS的最有效方法。在这项研究中,我们使用深度学习技术,基于对最新推出的Illumina基因分型阵列的分析,建立了准确的DS预测/筛选模型。具体来说,我们根据范德比尔特大学医学中心收集的临床基因分型数据构建了染色体SNP图谱。然后,我们提出了一个具有十层和两个合并的CNN模型的卷积神经网络(CNN)架构,该架构结合了两个输入染色体SNP图。我们的CNN DS预测/筛查模型的平均准确率超过99.3%,而且假阳性和假阴性率非常低,这对于医学实践中的疾病预测和筛查至关重要。与三种传统的机器学习算法相比,它在所有评估指标方面的性能也更好。最后,我们从经过训练的CNN模型的中间层可视化了特征图和经过训练的过滤器权重。我们进一步讨论了该方法的优点以及其强大性能的根本原因。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号