Redundancy-weighting the PDB for detailed secondary structure prediction using deep-learning models

首页> 外文期刊>Bioinformatics >Redundancy-weighting the PDB for detailed secondary structure prediction using deep-learning models

【24h】

Redundancy-weighting the PDB for detailed secondary structure prediction using deep-learning models

机译：使用深学习模型进行冗余加权PDB以进行详细的二级结构预测

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Motivation: The Protein Data Bank (PDB), the ultimate source for data in structural biology, is inherently imbalanced. To alleviate biases, virtually all structural biology studies use nonredundant (NR) subsets of the PDB, which include only a fraction of the available data. An alternative approach, dubbed redundancy-weighting (RW), down-weights redundant entries rather than discarding them. This approach may be particularly helpful for machine-learning (ML) methods that use the PDB as their source for data. Methods for secondary structure prediction (SSP) have greatly improved over the years with recent studies achieving above 70% accuracy for eight-class (DSSP) prediction. As these methods typically incorporate ML techniques, training on RW datasets might improve accuracy, as well as pave the way toward larger and more informative secondary structure classes.

机译：动机：蛋白质数据库（PDB）是结构生物学中数据的最终来源，本身是不平衡的。为了减轻偏差，几乎所有结构生物学研究都使用PDB的非冗余（NR）子集，其仅包括可用数据的一小部分。替代方法，被称为冗余加权（RW），减速冗余条目而不是丢弃它们。这种方法可能特别有助于使用PDB作为数据源的机器学习（ML）方法。二次结构预测方法（SSP）多年来大大提高，最近的研究实现了八级（DSSP）预测的70％的精度高于70％。由于这些方法通常包含ML技术，因此对RW数据集的训练可能提高精度，以及朝向更大和更具信息丰富的二级结构类的方式铺平道路。

著录项

来源
《Bioinformatics》 |2020年第12期|共6页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物工程学（生物技术）;
关键词

相似文献

外文文献
中文文献
专利

1. Redundancy-weighting the PDB for detailed secondary structure prediction using deep-learning models [J] . Bioinformatics . 2020,第12期

机译：使用深学习模型进行冗余加权PDB以进行详细的二级结构预测
2. Detailed secondary structure models of invertebrate 7SK RNAs [J] . Yazbeck Ali M., Tout Kifah R., Stadler Peter F. RNA biology . 2018,第2期

机译：无脊椎动物7SK RNA的详细二级结构模型
3. Prediction of protein continuum secondary structure with probabilistic models based on NMR solved structures [J] . Mikael Bodén, Zheng Yuan, Timothy L Bailey BMC Bioinformatics . 2006,第1期

机译：基于NMR解决结构的概率模型预测蛋白质连续二次结构
4. Protein secondary structure prediction of PDB 4HU7 using Genetic Algorithm (GA) [C] . Subhendu Bhusan Rout, Sumitra Kisan, Sasmita Mishra International Conference on Computer Communication and Informatics . 2017

机译：使用遗传算法（GA）预测PDB 4HU7的蛋白质二级结构
5. Protein structure prediction and conformational transitions. I. Improvement of protein secondary structure prediction. II. Pathways of conformational transition originating in phosphorylation: A study of CDK2 using targeted molecular dynamics and coarse grained models [D] . Cheng, Haitao 2009

机译：蛋白质结构预测和构象过渡。 I.改善蛋白质二级结构预测。 II。源于磷酸化的构象过渡的途径：使用靶分子动力学和粗粒模型的CDK2研究
6. NNvPDB: Neural Network based Protein Secondary Structure Prediction with PDB Validation [O] . Seethalakshmi Sakthivel, Habeeb S.K.M 2015

机译：NNvPDB：具有PDB验证的基于神经网络的蛋白质二级结构预测
7. Detailed mapping of RNA secondary structures in core and NS5B-encoding region sequences of hepatitis C virus by RNase cleavage and novel bioinformatic prediction methods [O] . Tuplin, A, Evans, DJ, Simmonds, P 2004

机译：丙型肝炎病毒核心和NS5B编码区域序列中RNA酶的裂解和新型生物信息学预测方法对RNA二级结构的详细定位

Redundancy-weighting the PDB for detailed secondary structure prediction using deep-learning models

摘要

著录项

相似文献

相关主题

期刊订阅