首页> 外文会议>International Conference on Bioinformatics and Biomedical Engineering >Chaos game representation for discriminating thermophilic from mesophilic protein sequences
【24h】

Chaos game representation for discriminating thermophilic from mesophilic protein sequences

机译:CHAOS游戏表示,用于鉴别嗜合蛋白序列的嗜热嗜热序列

获取原文

摘要

Can sequence analysis tell us about the function of protein? A basic question in protein science is which kind of proteins extent thermostability. Chaos game representation (CGR) can investigate the patterns hiding in protein sequence, visually revealing previously unknown structure. In this paper, we convert every protein sequence into a 20-dimensional vector by CGR algorithm, and based on these vectors we discriminate thermophiles from mesophiles using support vector machine (SVM). The overall accuracy achieves 100% in resubstitution test, and 87.12% in Jackknife test. Moreover, Matthews correlation coefficients (MCC) is 0.745.
机译:序列分析可以告诉我们蛋白质的功能吗?蛋白质科学的基本问题是哪种蛋白质程度的热稳定性。 Chaos游戏表示(CGR)可以调查隐藏在蛋白质序列中的模式,视觉揭示先前未知的结构。在本文中,我们通过CGR算法将每种蛋白质序列转换为20维向量,并基于这些向量,我们使用支持向量机(SVM)区分来自Mesophiles的热电缆。整体准确性在重新提交试验中实现了100%,千克试验中的87.12%。此外,Matthews相关系数(MCC)为0.745。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号