...
首页> 外文期刊>Gene: An International Journal Focusing on Gene Cloning and Gene Structure and Function >Gene expression classification using epigenetic features and DNA sequence composition in the human embryonic stem cell line H1
【24h】

Gene expression classification using epigenetic features and DNA sequence composition in the human embryonic stem cell line H1

机译:利用人类胚胎干细胞系H1的表观遗传特征和DNA序列组成进行基因表达分类

获取原文
获取原文并翻译 | 示例
           

摘要

Epigenetic factors are known to correlate with gene expression in the existing studies. However, quantitative models that accurately classify the highly and lowly expressed genes based on epigenetic factors are currently lacking. In this study, a new machine learning method combines histone modifications, DNA methylation, DNA accessibility, transcription factors, and trinucleotide composition with support vector machines (SVM) is developed in the context of human embryonic stem cell line (H1). The results indicate that the predictive accuracy will be markedly improved when the epigenetic features are considered. The predictive accuracy and Matthews correlation coefficient of the best model are as high as 95.96% and 0.92 for 10-fold cross-validation test, and 95.58% and 0.92 for independent dataset test, respectively. Our model provides a good way to judge a gene is either highly or lowly expressed gene by using genetic and epigenetic data, when the expression data of the gene is lacking. And a web-server GECES for our analysis method is established at http://202.207.14.87:8032/fuwu/GECES/index.asp, so that other scientists can easily get their desired results by our web-server, without going through the mathematical details. (C) 2016 Elsevier B.V. All rights reserved.
机译:在现有研究中,表观遗传因素与基因表达相关。但是,目前缺乏基于表观遗传因素对高表达和低表达基因进行准确分类的定量模型。在这项研究中,一种新的机器学习方法结合了组蛋白修饰,DNA甲基化,DNA可及性,转录因子和三核苷酸组成与支持向量机(SVM),在人类胚胎干细胞系(H1)的背景下得以开发。结果表明,考虑表观遗传特征后,预测准确性将得到显着提高。最佳模型的预测准确性和Matthews相关系数对于10倍交叉验证测试分别高达95.96%和0.92,对于独立数据集测试则分别高达95.58%和0.92。当缺乏基因表达数据时,我们的模型提供了一种通过使用遗传和表观遗传数据判断基因是高表达还是低表达的好方法。在http://202.207.14.87:8032/fuwu/GECES/index.asp上建立了用于我们的分析方法的网络服务器GECES,以便其他科学家可以轻松地通过我们的网络服务器获得所需的结果,而无需经历数学细节。 (C)2016 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号