首页> 外文期刊>The Plant Genome >Machine learning analyses of methylation profiles uncovers tissue‐specific gene expression patterns in wheat
【24h】

Machine learning analyses of methylation profiles uncovers tissue‐specific gene expression patterns in wheat

机译:甲基化型材的机器学习分析揭示了小麦组织特异性基因表达模式

获取原文
           

摘要

DNA methylation is a mechanism of epigenetic modification in eukaryotic organisms. Generally, methylation within genes promoter inhibits regulatory protein binding and represses transcription, whereas gene body methylation is associated with actively transcribed genes. However, it remains unclear whether there is interaction between methylation levels across genic regions and which site has the biggest impact on gene regulation. We investigated and used the methylation patterns of the bread wheat cultivar Chinese Spring to uncover differentially expressed genes (DEGs) between roots and leaves, using six machine learning algorithms and a deep neural network. As anticipated, genes with higher expression in leaves were mainly involved in photosynthesis and pigment biosynthesis processes whereas genes that were not differentially expressed between roots and leaves were involved in protein processes and membrane structures. Methylation occurred preponderantly (60%) in the CG context, whereas 35 and 5% of methylation occurred in CHG and CHH contexts, respectively. Methylation levels were highly correlated (r?=?0.7 to 0.9) between all genic regions, except within the promoter (r?=?0.4 to 0.5). Machine learning models gave a high (0.81) prediction accuracy of DEGs. There was a strong correlation (p‐value?=?9.20×10?10) between all features and gene expression, suggesting that methylation across all genic regions contribute to gene regulation. However, the methylation of the promoter, the CDS and the exon in CG context was the most impactful. Our study provides more insights into the interplay between DNA methylation and gene expression and paves the way for identifying tissue‐specific genes using methylation profiles.
机译:DNA甲基化是真核生物中表观遗传改性的机制。通常,基因启动子内的甲基化抑制调节蛋白结合并抑制转录,而基因体甲基化与主动转录的基因有关。然而,它仍然尚不清楚遗传区域甲基化水平之间是否存在相互作用,并且哪种部位对基因调节产生最大的影响。我们研究并使用了面包小麦品种中国弹簧的甲基化模式,以使用六种机器学习算法和深神经网络揭示根部和叶子之间的差异表达基因(DEG)。如预期的那样,叶片表达更高的基因主要参与光合作用和颜料生物合成过程,而在根和叶子之间没有差异表达的基因涉及蛋白质方法和膜结构。在CG上下文中优先发生(60%)发生甲基化,而35和5%的甲基化分别发生在CHG和CHH的情况下。除促进剂中,甲基化水平在所有遗传区域之间的高度相关(R?= 0.7至0.9),除了启动子内(R?= 0.4至0.5)。机器学习模型具有高(0.81)的DEG预测精度。在所有特征和基因表达之间存在强烈的相关性(p值?=Δ= 9.20×10?10),表明所有遗传区域的甲基化有助于基因调控。然而,促进剂的甲基化,CG中的CD和外显子是最有影响的。我们的研究提供了更多的见解,进入DNA甲基化和基因表达之间的相互作用,并铺平使用甲基化型材鉴定组织特异性基因的方式。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号