首页> 美国卫生研究院文献>Wiley-Blackwell Online Open >The path of no return—Truncated protein N‐termini and current ignorance of their genesis
【2h】

The path of no return—Truncated protein N‐termini and current ignorance of their genesis

机译:无回报的途径—截短的蛋白N末端及其目前对其起源的无知

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Almost all regulatory processes in biology ultimately lead to or originate from modifications of protein function. However, it is unclear to which extent each mechanism of regulation actually affects proteins and thus phenotypes. We assessed the extent of N‐terminal protein truncation in a global analysis of N‐terminomics data and find that most proteins have N‐terminally truncated proteoforms. Because N‐terminomics analyses do not identify the process generating the identified N‐termini, we compared identified termini to the three N‐termini generating events: protein cleavage, alternative translation, and alternative splicing. Of these, we sought to identify the most likely cause of N‐terminal protein truncations in the human proteome. We found that protease cleavage and alternative protein translation are the likely cause for most shortened proteoforms. However, the vast majority (about 90%) of N‐termini remain unexplained by any of these processes identified to date, so revealing large gaps in our knowledge of protein termini and their genesis. Further analysis and annotation of terminomics data is required, to which end we have created the TopFIND database, a major systematic annotation effort for protein termini. We outline the new features in version 3.0 of the updated database and the new bioinformatics tools available and encourage submission of generated data to fill current knowledge gaps.
机译:生物学中几乎所有的调节过程最终都会导致或起源于蛋白质功能的改变。然而,目前尚不清楚每种调节机制在多大程度上真正影响蛋白质,进而影响表型。我们在对N-术语组学数据的全局分析中评估了N-末端蛋白截短的程度,发现大多数蛋白质具有N-末端截短的蛋白形式。由于N术语组学分析无法确定生成所确定的N端的过程,因此我们将鉴定的末端与三个N端发生事件进行了比较:蛋白质切割,替代翻译和替代剪接。其中,我们试图确定人类蛋白质组中N末端蛋白截短的最可能原因。我们发现蛋白酶切割和替代蛋白质翻译是大多数缩短的蛋白形式的可能原因。然而,迄今为止,这些过程中的任何一个都无法解释N-末端的绝大多数(约90%),因此揭示了我们对蛋白质末端及其起源的认识上的巨大差距。需要对术语学数据进行进一步的分析和注释,为此,我们创建了TopFIND数据库,这是蛋白质末端的主要系统注释工作。我们概述了更新数据库的3.0版中的新功能以及可用的新生物信息学工具,并鼓励提交生成的数据以填补当前的知识空白。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号