首页> 美国卫生研究院文献>Molecules >Systematic Identification of Machine-Learning Models Aimed to Classify Critical Residues for Protein Function from Protein Structure
【2h】

Systematic Identification of Machine-Learning Models Aimed to Classify Critical Residues for Protein Function from Protein Structure

机译:旨在基于蛋白质结构对蛋白质功能的关键残基进行分类的机器学习模型的系统识别

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Protein structure and protein function should be related, yet the nature of this relationship remains unsolved. Mapping the critical residues for protein function with protein structure features represents an opportunity to explore this relationship, yet two important limitations have precluded a proper analysis of the structure-function relationship of proteins: (i) the lack of a formal definition of what critical residues are and (ii) the lack of a systematic evaluation of methods and protein structure features. To address this problem, here we introduce an index to quantify the protein-function criticality of a residue based on experimental data and a strategy aimed to optimize both, descriptors of protein structure (physicochemical and centrality descriptors) and machine learning algorithms, to minimize the error in the classification of critical residues. We observed that both physicochemical and centrality descriptors of residues effectively relate protein structure and protein function, and that physicochemical descriptors better describe critical residues. We also show that critical residues are better classified when residue criticality is considered as a binary attribute (i.e., residues are considered critical or not critical). Using this binary annotation for critical residues 8 models rendered accurate and non-overlapping classification of critical residues, confirming the multi-factorial character of the structure-function relationship of proteins.
机译:蛋白质结构和蛋白质功能应该相关,但是这种关系的性质尚未解决。绘制具有蛋白质结构特征的蛋白质功能关键残基代表了探索这种关系的机会,但是两个重要的局限性阻碍了对蛋白质结构-功能关系的正确分析:(i)缺乏对关键残基的正式定义和(ii)缺乏对方法和蛋白质结构特征的系统评价。为了解决这个问题,我们在此引入了一种基于实验数据的指标来量化残基的蛋白质功能关键性,以及旨在同时优化蛋白质结构的描述符(理化和中心性描述符)和机器学习算法的策略,以最大程度地减少残基。关键残基的分类错误。我们观察到,残基的物理化学和中心性描述符都有效地与蛋白质结构和蛋白质功能相关,并且理化描述符更好地描述了关键残基。我们还显示,当将残基关键性视为二元属性时(即残基被视为关键或不关键),关键残基可得到更好的分类。使用此关键残基的二进制注释,8个模型对关键残基进行了准确且不重叠的分类,从而确认了蛋白质结构-功能关系的多因素特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号