DEEPred: Automated Protein Function Prediction with Multi-task Feed-forward Deep Neural Networks

Ahmet Sureyya Rifaioglu; Tunca Do?an; Maria Jesus Martin; Rengul Cetin-Atalay; Volkan Atalay

首页> 外文期刊>Scientific reports. >DEEPred: Automated Protein Function Prediction with Multi-task Feed-forward Deep Neural Networks

【24h】

DEEPred: Automated Protein Function Prediction with Multi-task Feed-forward Deep Neural Networks

机译：深入：具有多任务前馈深层神经网络的自动化蛋白质功能预测

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automated protein function prediction is critical for the annotation of uncharacterized protein sequences, where accurate prediction methods are still required. Recently, deep learning based methods have outperformed conventional algorithms in computer vision and natural language processing due to the prevention of overfitting and efficient training. Here, we propose DEEPred, a hierarchical stack of multi-task feed-forward deep neural networks, as a solution to Gene Ontology (GO) based protein function prediction. DEEPred was optimized through rigorous hyper-parameter tests, and benchmarked using three types of protein descriptors, training datasets with varying sizes and GO terms form different levels. Furthermore, in order to explore how training with larger but potentially noisy data would change the performance, electronically made GO annotations were also included in the training process. The overall predictive performance of DEEPred was assessed using CAFA2 and CAFA3 challenge datasets, in comparison with the state-of-the-art protein function prediction methods. Finally, we evaluated selected novel annotations produced by DEEPred with a literature-based case study considering the 'biofilm formation process' in Pseudomonas aeruginosa. This study reports that deep learning algorithms have significant potential in protein function prediction; particularly when the source data is large. The neural network architecture of DEEPred can also be applied to the prediction of the other types of ontological associations. The source code and all datasets used in this study are available at: https://github.com/cansyl/DEEPred .

机译：自动化的蛋白质功能预测对于注释未表征的蛋白质序列至关重要，在这种情况下，仍需要精确的预测方法。最近，由于防止了过度拟合和有效的训练，基于深度学习的方法在计算机视觉和自然语言处理方面已经优于传统算法。在这里，我们提出了DEEPred，一种多任务前馈深度神经网络的分层堆栈，作为基于基因本体（GO）的蛋白质功能预测的一种解决方案。 DEEPred通过严格的超参数测试进行了优化，并使用三种类型的蛋白质描述符进行了基准测试，具有不同大小和GO项的训练数据集形成了不同的水平。此外，为了探索使用较大但可能有噪声的数据进行的训练将如何改变性能，在训练过程中还包括了电子制作的GO注释。与最先进的蛋白质功能预测方法相比，使用CAFA2和CAFA3挑战数据集评估了DEEPred的总体预测性能。最后，我们通过考虑到铜绿假单胞菌中“生物膜形成过程”的基于文献的案例研究，评估了DEEPred产生的一些新颖注释。这项研究报告说，深度学习算法在蛋白质功能预测中具有巨大的潜力。特别是当源数据很大时。 DEEPred的神经网络架构也可以应用于其他类型的本体关联的预测。该研究中使用的源代码和所有数据集可在以下网址获得：https://github.com/cansyl/DEEPred。

著录项

来源
《Scientific reports.》 |2019年第1期|共16页
作者
Ahmet Sureyya Rifaioglu; Tunca Do?an; Maria Jesus Martin; Rengul Cetin-Atalay; Volkan Atalay;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. TopP-S: Persistent homology-based multi-task deep neural networks for simultaneous predictions of partition coefficient and aqueous solubility [J] . Wu Kedi, Zhao Zhixiong, Wang Renxiao, Journal of Computational Chemistry: Organic, Inorganic, Physical, Biological . 2018,第19a20期

机译：TOPP-S：基于持久性的同源性的多任务深神经网络，用于同时预测分配系数和水溶性
2. TopologyNet: Topology based deep convolutional and multi-task neural networks for biomolecular property predictions [J] . Zixuan Cang, Guo-Wei Wei PLoS Computational Biology . 2017,第7期

机译：拓扑网：基于拓扑的深卷积和多任务神经网络，用于生物分子性能预测
3. Multi-task learning for the prediction of wind power ramp events with deep neural networks [J] . Dorado-Moreno M., Navarin N., Gutierrez P. A., Neural Networks: The Official Journal of the International Neural Network Society . 2020,第期

机译：深神经网络预测风电斜坡事件的多任务学习
4. Prediction of protein secondary structure using multilayer feed-forward neural networks [C] . Jian-wei Liu, Guang-hui Chi, Hai-en Li, Chinese Control and Decision Conference . 2013

机译：使用多层前馈神经网络预测蛋白质二级结构
5. In Silico Prediction of Protein Sequence Classification and Post Translational Modification Sites Using Deep Neural Networks [D] . White, Clarence R., Jr. 2018

机译：使用深度神经网络进行蛋白质序列分类和翻译后修饰位点的计算机模拟预测
6. DEEPred: Automated Protein Function Prediction with Multi-task Feed-forward Deep Neural Networks [O] . Ahmet Sureyya Rifaioglu, Tunca Doğan, Maria Jesus Martin, -1

机译：深入：具有多任务前馈深度神经网络的自动化蛋白质功能预测
7. DEEPred: Automated Protein Function Prediction with Multi-task Feed-forward Deep Neural Networks [O] . Ahmet Sureyya Rifaioglu, Tunca Doğan, Maria Jesus Martin, 2019

机译：DEEPRED：具有多任务前馈深神经网络的自动蛋白质功能预测

DEEPred: Automated Protein Function Prediction with Multi-task Feed-forward Deep Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅