N-GlyDE: a two-stage N-linked glycosylation site prediction incorporating gapped dipeptides and pattern-based encoding

Thejkiran Pitti; Ching-Tai Chen; Hsin-Nan Lin; Wai-Kok Choong; Wen-Lian Hsu; Ting-Yi Sung

首页> 外文期刊>Scientific reports. >N-GlyDE: a two-stage N-linked glycosylation site prediction incorporating gapped dipeptides and pattern-based encoding

【24h】

N-GlyDE: a two-stage N-linked glycosylation site prediction incorporating gapped dipeptides and pattern-based encoding

机译：n-glyde：一种掺入撕开的二肽和基于图案的编码的两阶段n键合糖基化位点预测

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

N-linked glycosylation is one of the predominant post-translational modifications involved in a number of biological functions. Since experimental characterization of glycosites is challenging, glycosite prediction is crucial. Several predictors have been made available and report high performance. Most of them evaluate their performance at every asparagine in protein sequences, not confined to asparagine in the N-X-S/T sequon. In this paper, we present N-GlyDE, a two-stage prediction tool trained on rigorously-constructed non-redundant datasets to predict N-linked glycosites in the human proteome. The first stage uses a protein similarity voting algorithm trained on both glycoproteins and non-glycoproteins to predict a score for a protein to improve glycosite prediction. The second stage uses a support vector machine to predict N-linked glycosites by utilizing features of gapped dipeptides, pattern-based predicted surface accessibility, and predicted secondary structure. N-GlyDE's final predictions are derived from a weight adjustment of the second-stage prediction results based on the first-stage prediction score. Evaluated on N-X-S/T sequons of an independent dataset comprised of 53 glycoproteins and 33 non-glycoproteins, N-GlyDE achieves an accuracy and MCC of 0.740 and 0.499, respectively, outperforming the compared tools. The N-GlyDE web server is available at http://bioapp.iis.sinica.edu.tw/N-GlyDE/ .

机译：N-连接的糖基化是涉及许多生物学功能的主要翻译后修改之一。由于血糖上的实验表征是挑战性的，因此糖化预测至关重要。已经提供了几种预测因子并报告了高性能。其中大多数评估它们在蛋白质序列中的每一种天冬酰胺的性能，而不是在N-X-S / T序列中限制到天冬酰胺。在本文中，我们呈现N-Glyde，在严格构造的非冗余数据集上训练的两级预测工具，以预测人蛋白质组中的N键合血糖技术。第一阶段使用培训糖蛋白和非糖蛋白的蛋白质相似性投票算法，以预测蛋白质以改善糖化预测的分数。第二阶段使用支持向量机通过利用覆盖二肽，基于图案的预测表面可访问性和预测的二级结构的特征来预测N链综合材料。基于第一阶段预测得分的第二阶段预测结果的权重调整，N-Glyde的最终预测结果来自于第一阶段预测得分。在由53个糖蛋白和33个非糖蛋白组成的独立数据集的N-X-S / T序列中评价，N-Glyde分别达到0.740和0.499的精度和MCC，优于比较的工具。 N-Glyde Web服务器可在http://bioapp.iis.sinica.edu.tw/n-glyde/上获取。

著录项

来源
《Scientific reports.》 |2019年第1期|共页
作者
Thejkiran Pitti; Ching-Tai Chen; Hsin-Nan Lin; Wai-Kok Choong; Wen-Lian Hsu; Ting-Yi Sung;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Computational prediction of N-linked glycosylation incorporating structural properties and patterns [J] . Chuang Gwo-Yu, Boyington Jeffrey C., Joyce M. Gordon, Bioinformatics . 2012,第17期

机译：结合结构特性和模式的N-连接糖基化的计算预测
2. Computational prediction of N-linked glycosylation incorporating structural properties and patterns [J] . Gwo-Yu Chuang Jeffrey C. Boyington M. Gordon Joyce Jiang Zhu Gary J. Nabel Peter D. Kwong and Ivelin Georgiev* Bioinformatics . 2012,第17期

机译：结合结构特性和模式的N-连接糖基化的计算预测
3. Structure-based Comparative Analysis and Prediction of N-linked Glycosylation Sites in Evolutionarily Distant Eukaryotes * [J] . Phuc Vinh Nguyen Lam, Radoslav Goldman, Konstantinos Karagiannis, Genomics, proteomics & bioinformatics . 2013,第2期

机译：基于结构的比较分析和预测远距离真核生物中N-联糖基化位点*
4. NEURAL NETWORK-BASED PREDICTION OF VARIABLE SITE-OCCUPANCY OF N-LINKED GLYCOSYLATION [C] . Ryan S. Senger, M. Nazmul Karim International Federation of Agricultural Producers International Symposium . 2005

机译：基于神经网络的N-连接糖基化的可变部位占用的预测
5. Modeling of recombinant enzyme inactivation and prediction of N-linked glycosylation site-occupancy and microheterogeneity. [D] . Senger, Ryan S. 2005

机译：重组酶灭活的建模和N-联糖基化位点的占有率和微异质性的预测。
6. Computational prediction of N-linked glycosylation incorporating structural properties and patterns [O] . Gwo-Yu Chuang, Jeffrey C. Boyington, M. Gordon Joyce, -1

机译：结合结构特性和模式的N-连接糖基化的计算预测
7. Prediction of N-linked glycosylation sites using position relative features and statistical moments. [O] . Muhammad Aizaz Akmal, Nouman Rasool, Yaser Daanial Khan 2017

机译：使用位置相对特征和统计矩预测N-连接的糖基化位点。

N-GlyDE: a two-stage N-linked glycosylation site prediction incorporating gapped dipeptides and pattern-based encoding

摘要

著录项

相似文献

相关主题

期刊订阅