首页> 外文学位 >Predicting DNA methylation state of CPG dinucleotide using genome topological features and deep networks.
【24h】

Predicting DNA methylation state of CPG dinucleotide using genome topological features and deep networks.

机译:使用基因组拓扑特征和深层网络预测CPG二核苷酸的DNA甲基化状态。

获取原文
获取原文并翻译 | 示例

摘要

The hypo- or hyper-methylation of the human genome is one of the epigenetic features of leukemia. However, experimental approaches have only determined the methylation state of a small portion of the human genome. I developed a deep learning based (stacked denoising autoencoders, or SdA) software named "DeepMethyl" to predict the methylation state of DNA CpG dinucleotides using features inferred from three-dimensional genome topology (based on Hi-C) and DNA sequence patterns. I used the experimental data from immortalized myelogenous leukemia (K562) and healthy lymphoblastoid (GM12878) cell lines to train the learning models and assess prediction performance. I have tested various SdA architectures with different configurations of hidden layer(s) and amount of pre-training data and compared the performance of deep networks relative to support vector machines (SVM). Using the methylation states of sequentially neighboring regions as one of the learning features, SdA achieved a blind test accuracy of 89.7% for GM12878 and 88.6% for K562. When the methylation states of sequentially neighboring regions are unknown, the accuracies are 84.82% for GM12878 and 72.01% for K562. I also analyzed the contribution of genome topological features inferred from Hi-C. DeepMethyl can be accessed at http://dna.cs.usm.edu/deepmethyl/.
机译:人类基因组的甲基化过低或过高是白血病的表观遗传特征之一。然而,实验方法仅确定了人类基因组的一小部分的甲基化状态。我开发了一种基于深度学习的软件(称为“ DeepMethyl”的叠加式去噪自动编码器,即SdA),可使用从三维基因组拓扑(基于Hi-C)和DNA序列模式推断出的特征来预测DNA CpG二核苷酸的甲基化状态。我使用了永生的骨髓性白血病(K562)和健康的成淋巴细胞样(GM12878)细胞系的实验数据来训练学习模型并评估预测性能。我测试了具有不同隐藏层配置和预训练数据量的各种SdA架构,并比较了深层网络相对于支持向量机(SVM)的性能。使用顺序相邻区域的甲基化状态作为学习特征之一,SdA的盲测试准确度对GM12878为89.7%,对K562为88.6%。当顺序相邻区域的甲基化状态未知时,GM12878的准确度为84.82%,K562的准确度为72.01%。我还分析了从Hi-C推断出的基因组拓扑特征的贡献。可以从http://dna.cs.usm.edu/deepmethyl/访问DeepMethyl。

著录项

  • 作者

    Wang, Yiheng.;

  • 作者单位

    The University of Southern Mississippi.;

  • 授予单位 The University of Southern Mississippi.;
  • 学科 Bioinformatics.;Statistics.;Computer science.
  • 学位 M.S.
  • 年度 2016
  • 页码 63 p.
  • 总页数 63
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号