首页> 外文会议>WSEAS International Conferences >Prediction of Methylation Status on DNA sequences and Identification of Its Important DNA Sequence Features
【24h】

Prediction of Methylation Status on DNA sequences and Identification of Its Important DNA Sequence Features

机译:预测DNA序列甲基化状态及其重要DNA序列特征的鉴定

获取原文

摘要

In mammals, cytosines of most CpG dinucleotides in their genomes except gene promoters are subject to modification by methyl group (methylation). A number of genes in a mammal are regulated developmental-specifically or tissue-specifically by the methylation. Mammalian DNA methylation contributes to regulation of gene expression, repression of parasitic sequences, inactivation of X chromosome in female, genomic imprinting, etc. Aberrant methylation results in a cancer or a part of genetic diseases in human. Therefore it is required that methylation status on human genome is comprehensively revealed in each kind of cells. However, since comprehensive methylation analyses require a lot of times and large labor, methylation status on only a part of genomic regions is revealed in mammals. Because of this, machine learning using already known methylation data and prediction of methylation status on other genomic regions are important. Moreover, since sequence differences between unmethylated and methylated DNA regions also remain unclear, those differences should be also determined. Therefore we conducted machine learning by support vector machine using our previously reported methylation data, and predicted methylation status on DNA sequences from DNA sequence features. Furthermore we explored different sequence features between unmethylated and methylated DNA sequences using random forest.
机译:在哺乳动物中,在除基因启动子其基因组中大部分的CpG二核苷酸的胞嘧啶受到修改由甲基(甲基化)。哺乳动物中的许多基因是由甲基化特异性调节的显影性或组织。哺乳动物DNA甲基化有助于调节基因表达,寄生序列的抑制,X染色体在雌性,基因组印迹等中的灭活。异常甲基化导致人类的癌症或一部分遗传疾病。因此,需要在每种细胞中全面地揭示人类基因组上的甲基化状态。然而,由于综合性甲基化分析需要大量的次数和大劳动力,因此在哺乳动物中仅显示了一部分基因组区域的甲基化状态。因此,使用已知的甲基化数据和其他基因组区域上的甲基化状态预测的机器学习是重要的。此外,由于未甲基化和甲基化的DNA区域之间的序列差异也仍然不明确,因此还应确定这些差异。因此,我们使用先前报告的甲基化数据通过支持向量机进行机器学习,并从DNA序列特征预测DNA序列上的甲基化状态。此外,我们使用随机森林探讨了未甲基化和甲基化DNA序列之间的不同序列特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号