...
首页> 外文期刊>Journal of Theoretical Biology >Improving protein fold recognition and structural class prediction accuracies using physicochemical properties of amino acids
【24h】

Improving protein fold recognition and structural class prediction accuracies using physicochemical properties of amino acids

机译:利用氨基酸的理化特性提高蛋白质折叠识别和结构类别预测的准确性

获取原文
获取原文并翻译 | 示例
           

摘要

Predicting the three-dimensional (3-D) structure of a protein is an important task in the field of bioin-formatics and biological sciences. However, directly predicting the 3-D structure from the primary structure is hard to achieve. Therefore, predicting the fold or structural class of a protein sequence is generally used as an intermediate step in determining the protein's 3-D structure. For protein fold recognition (PFR) and structural class prediction (SCP), two steps are required feature extraction step and classification step. Feature extraction techniques generally utilize syntactical-based information, evolutionary-based information and physicochemical-based information to extract features. In this study, we explore the importance of utilizing the physicochemical properties of amino acids for improving PFR and SCP accuracies. For this, we propose a Forward Consecutive Search (FCS) scheme which aims to strategically select physicochemical attributes that will supplement the existing feature extraction techniques for PFR and SCP. An exhaustive search is conducted on all the existing 544 physicochemical attributes using the proposed FCS scheme and a subset of physicochemical attributes is identified. Features extracted from these selected attributes are then combined with existing syntactical-based and evolutionary-based features, to show an improvement in the recognition and prediction performance on benchmark datasets. (C) 2016 Elsevier Ltd. All rights reserved.
机译:预测蛋白质的三维(3-D)结构是生物信息学和生物科学领域的重要任务。但是,难以从一级结构直接预测3-D结构。因此,预测蛋白质序列的折叠或结构类别通常用作确定蛋白质3-D结构的中间步骤。对于蛋白质折叠识别(PFR)和结构分类预测(SCP),需要两个步骤,分别是特征提取步骤和分类步骤。特征提取技术通常利用基于句法的信息,基于进化的信息和基于物理化学的信息来提取特征。在这项研究中,我们探索了利用氨基酸的理化特性改善PFR和SCP准确性的重要性。为此,我们提出了一种前向连续搜索(FCS)方案,该方案旨在从战略上选择物化属性,以补充PFR和SCP的现有特征提取技术。使用提议的FCS方案对所有544个现有物理化学属性进行详尽搜索,并识别出一部分物理化学属性。然后,将从这些选定属性中提取的特征与现有的基于句法和基于进化的特征进行组合,以显示对基准数据集的识别和预测性能的改进。 (C)2016 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号