首页> 美国卫生研究院文献>BMC Bioinformatics >Accurate prediction of protein enzymatic class by N-to-1 Neural Networks
【2h】

Accurate prediction of protein enzymatic class by N-to-1 Neural Networks

机译:N对1神经网络准确预测蛋白质酶类

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We present a novel ab initio predictor of protein enzymatic class. The predictor can classify proteins, solely based on their sequences, into one of six classes extracted from the enzyme commission (EC) classification scheme and is trained on a large, curated database of over 6,000 non-redundant proteins which we have assembled in this work. The predictor is powered by an ensemble of N-to-1 Neural Network, a novel architecture which we have recently developed. N-to-1 Neural Networks operate on the full sequence and not on predefined features. All motifs of a predefined length (31 residues in this work) are considered and are compressed by an N-to-1 Neural Network into a feature vector which is automatically determined during training. We test our predictor in 10-fold cross-validation and obtain state of the art results, with a 96% correct classification and 86% generalized correlation. All six classes are predicted with a specificity of at least 80% and false positive rates never exceeding 7%. We are currently investigating enhanced input encoding schemes which include structural information, and are analyzing trained networks to mine motifs that are most informative for the prediction, hence, likely, functionally relevant.
机译:我们提出了一种新的从头开始的蛋白质酶类预测因子。预测器可以仅根据其序列将蛋白质分类为从酶委员会(EC)分类方案中提取的六种分类之一,并在庞大的精选数据库中进行了训练,该数据库包含6,000多种非冗余蛋白质。该预测器由N对1神经网络集成提供支持,这是我们最近开发的一种新颖架构。 N对1神经网络以完整序列运行,而不是以预定义功能运行。会考虑所有预定义长度的图案(此工作中有​​31个残基),并通过N对1神经网络将其压缩为特征向量,该特征向量会在训练过程中自动确定。我们在10倍交叉验证中测试了我们的预测变量,并获得了最先进的结果,正确分类率为96%,广义相关度为86%。预测所有六个类别的特异性至少为80%,假阳性率永远不会超过7%。我们目前正在研究包括结构信息在内的增强型输入编码方案,并且正在分析经过训练的网络来挖掘对预测最有用的图案,因此可能与功能相关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号