首页> 外文会议>International Conference on Artificial Intelligence >Intelligent analysis of methylation data in Head and Neck Squamous Cell Carcinoma (HNSCC) interactomes
【24h】

Intelligent analysis of methylation data in Head and Neck Squamous Cell Carcinoma (HNSCC) interactomes

机译:头颈鳞状细胞癌(HNSCC)椎间囊组致甲基化数据智能分析

获取原文

摘要

Head and neck squamous cell carcinoma (HNSCC) is a heterogeneous disease which arises due to various genetic, epigenetic and environmental factors. DNA methylation is an epigenetic factor that is found to have a role in the development and progression of HNSCC through genetic and epigenetic silencing. Analysis of the methylation data can facilitate us to explore variations in several gene sites and can narrow down our search for curing HNSCC. The aim of this study was to explore and analyze the DNA Methylation data of HNSCC and make intelligent machine learning (ML) models that can predict the expression levels of a particular gene site based on various features. Difference between the gene expression levels of normal and tumor samples obtained from TCGA was calculated and then the genes were classified into hypo-methylated, hyper-methylated and non-methylated, respectively. Moreover, network analysis and functional enrichment analysis was performed to identify the protein-protein interaction (PPI) and involvement in the biological process followed by training logistic regression, support vector machine (SVM) and k-nearest neighbors (KNN) models for prediction. Logistic regression was found to have the highest accuracy of 65% among all the ML models. Furthermore, MYC, POLR2A, ALB, MTOR, H2AFX, SMARCA4, PAX6, GATA3 and MDM2 were identified as the hub genes in the HNSCC network. Whereas, hyper-methylated, hypo-methylated and non-methylated genes were found to be enriched in neuroactive ligand-receptor interaction, neurogenesis, ion transport channels, cell cycle and plasma membrane. In future, more data and features are required for validation and improving the accuracy of the ML models.
机译:头部和颈部鳞状细胞癌(HNSCC)是一种异质疾病,由于各种遗传,表观遗传和环境因素而产生。 DNA甲基化是一种表观遗传因素,发现通过遗传和表观遗传沉默在HNSCC的发育和进展中具有作用。对甲基化数据的分析可以促进我们探讨几个基因网站的变化,并且可以缩小我们对HNSCC治愈的搜索。本研究的目的是探索和分析HNSCC的DNA甲基化数据,并使智能机器学习(ML)模型基于各种特征预测特定基因位点的表达水平。计算了从TCGA获得的正常和肿瘤样品的基因表达水平之间的差异,然后将基因分别分为低甲基化,超甲基化和非甲基化。此外,进行网络分析和功能性富集分析以鉴定蛋白质 - 蛋白质相互作用(PPI)并参与生物过程,然后参与训练物流回归,支持向量机(SVM)和K最近邻居(KNN)模型进行预测。发现Logistic回归在所有ML模型中具有65%的最高精度。此外,MYC,POLR2A,ALB,MTOR,H2AFX,SMARCA4,PAX6,GATA3和MDM2被确定为HNSCC网络中的集线器基因。然而,发现超甲基化的,甲基化和非甲基化基因富含神经活性配体 - 受体相互作用,神经发生,离子输送通道,细胞周期和质膜。在未来,更多的数据和功能都需要验证和改进ML模型的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号