首页> 外文会议>IEEE Region 10 Symposium >LyFor:Prediction of lysine formylation sites from sequence based features using support vector machine
【24h】

LyFor:Prediction of lysine formylation sites from sequence based features using support vector machine

机译:LyFor:使用支持向量机从基于序列的特征预测赖氨酸甲酰化位点

获取原文

摘要

Lysine formylation is a recently invented post- translational modification (PTM), which mostly resides on nuclear histone proteins. It is mainly responsible for playing an effective role in the mechanisms of cellular chromatin regulation such as DNA binding, DNA repair and protein synthesis and has great effect on other PTMs such as methylation and acetylation. As computational methods are simple, popular and high speedy compared to traditional experimental methods, it is very important and essential to generate mathematical model for proper identification of formylated lysine sites. A useful bioinformatics tool named LyFor, in this study, is developed by using amino acid composition (AAC), amino acid index (AAI), binary encoding (BE) and composition of k-spaced amino acid pair (CKSAAP) feature construction techniques to predict formylated lysine residues and non-formylated lysine residues. Moreover, a dimensional reduction method named principal component analysis (PCA) and randomly oversample method were used for preprocessing training dataset, which was applied to train the model with support vector machine algorithm. We have seen that LyFor achieves a better performance with an accuracy of 90.02 % for 10-fold cross-validation compared to existing models. Therefore, the analysis and prediction of lysine formylation may provide very useful information to study the mechanisms of chromatin regulation.
机译:赖氨酸甲酰化是最近发明的翻译后修饰(PTM),主要存在于核组蛋白上。它主要负责在细胞染色质调节机制(例如DNA结合,DNA修复和蛋白质合成)中发挥有效作用,并对其他PTM(例如甲基化和乙酰化)产生重大影响。与传统的实验方法相比,由于计算方法简单,普及且速度快,因此生成数学模型以正确识别甲酰化赖氨酸位点非常重要且必不可少。通过使用氨基酸组成(AAC),氨基酸索引(AAI),二进制编码(BE)和k间隔氨基酸对(CKSAAP)组成特征构建技术,开发了一种有用的名为LyFor的生物信息学工具。预测甲酰化的赖氨酸残基和非甲酰化的赖氨酸残基。此外,对预处理训练数据集使用了一种称为主成分分析(PCA)的降维方法和随机过采样方法,该方法用于支持向量机算法对模型的训练。我们已经看到,与现有模型相比,LyFor可以实现更好的性能,十倍交叉验证的准确度为90.02%。因此,赖氨酸甲酰化的分析和预测可能为研究染色质调控机制提供非常有用的信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号