首页> 外文会议>International Conference on Electrical, Computer and Telecommunication Engineering >K-mer Based DNA Methylation Status Prediction Using Support Vector Machine
【24h】

K-mer Based DNA Methylation Status Prediction Using Support Vector Machine

机译:基于K-MER基于支持向量机的DNA甲基化状态预测

获取原文

摘要

Cancer, diabetics, cardiovascular diseases and some of rare diseases are occurred due to the modification in human genome. DNA methylation is one of such type of genomic change. When one or more methyl groups are added to the DNA molecule, it is termed as DNA methylation. It is very difficult and challenging to deal with DNA methylation because of having high dimentionality and noisiness of methylation dataset. In this research work, we have proposed a machine learning based computational model for predicting DNA methylation. Our model is highly involved with bioinformatics and genomic information to extract genomic features. All previous works regarding the prediction of DNA methylation were only based on the statistical tools. Our model has two phases including the feature extraction and selection phase and the classification phase. In feature extraction phase we have used k-mer bioinformatics algorithm to extract the most appropriate features. Then using some statistical tools, we have selected the most discriminating features. Finally, we have chosen SVM, a supervised machine learning algorithm to predict the DNA methylation status. Our model has produced a classification accuracy of 98.65%.
机译:由于人类基因组的修饰,癌症,糖尿病患者,心血管疾病和一些罕见的疾病发生。 DNA甲基化是这样的类型的基因组变化之一。当将一个或多个甲基加入到DNA分子中时,它被称为DNA甲基化。由于具有高的甲基化数据集,对DNA甲基化进行处理是非常困难和挑战性的。在本研究工作中,我们提出了一种基于机器学习的计算模型,用于预测DNA甲基化。我们的模型高度涉及生物信息学和基因组信息,以提取基因组特征。关于预测DNA甲基化的所有工作都仅基于统计工具。我们的模型有两个阶段,包括特征提取和选择阶段和分类阶段。在特征提取阶段,我们使用了K-MER生物信息学算法来提取最合适的功能。然后使用一些统计工具,我们选择了最辨别的功能。最后,我们选择了SVM,监督机器学习算法预测DNA甲基化状态。我们的型号生产了98.65%的分类准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号