...
首页> 外文期刊>Nucleic acids research >PartsList: a web-based system for dynamically ranking protein folds based on disparate attributes, including whole-genome expression and interaction information
【24h】

PartsList: a web-based system for dynamically ranking protein folds based on disparate attributes, including whole-genome expression and interaction information

机译:PartsList:一个基于Web的系统,用于基于完全不同的属性(包括全基因组表达和相互作用信息)对蛋白质折叠进行动态排名

获取原文
   

获取外文期刊封面封底 >>

       

摘要

As the number of protein folds is quite limited, a mode of analysis that will be increasingly common in the future, especially with the advent of structural genomics, is to survey and re-survey the finite parts list of folds from an expanding number of perspectives. We have developed a new resource, called PartsList, that lets one dynamically perform these comparative fold surveys. It is available on the web at http://bioinfo.mbb.yale.edu/partslist and http://www.partslist.org. The system is based on the existing fold classifications and functions as a form of companion annotation for them, providing ‘global views' of many already completed fold surveys. The central idea in the system is that of comparison through ranking; PartsList will rank the approximately 420 folds based on more than 180 attributes. These include: (i) occurrence in a number of completely sequenced genomes (e.g.?it will show the most common folds in the worm versus yeast); (ii) occurrence in the structure databank (e.g. most common folds in the PDB); (iii) both absolute and relative gene expression information (e.g. most changing folds in expression over the cell cycle); (iv) protein–protein interactions, based on experimental data in yeast and comprehensive PDB surveys (e.g. most interacting fold); (v) sensitivity to inserted transposons; (vi) the number of functions associated with the fold (e.g. most multi-functional folds); (vii) amino acid composition (e.g. most Cys-rich folds); (viii) protein motions (e.g. most mobile folds); and (ix) the level of similarity based on a comprehensive set of structural alignments (e.g.?most structurally variable folds). The integration of whole-genome expression and protein–protein interaction data with structural information is a particularly novel feature of our system. We provide three ways of visualizing the rankings: a profiler emphasizing the progression of high and low ranks across many pre-selected attributes, a dynamic comparer for custom comparisons and a numerical rankings correlator. These allow one to directly compare very different attributes of a fold (e.g.?expression level, genome occurrence and maximum motion) in the uniform numerical format of ranks. This uniform framework, in turn, highlights the way that the frequency of many of the attributes falls off with approximate power-law behavior (i.e.?according to V–b, for attribute value V and constant exponent b), with a few folds having large values and most having small values.
机译:由于蛋白质折叠的数量非常有限,因此在未来,尤其是随着结构基因组学的出现,一种分析模式将越来越普遍,它是从越来越多的角度来调查和重新研究折叠的有限部分清单。我们已经开发了一种名为PartsList的新资源,该资源可以动态执行这些比较折叠调查。可以在以下网站上找到它:http://bioinfo.mbb.yale.edu/partslist和http://www.partslist.org。该系统基于现有的折页分类,并作为它们的伴随注释形式,为许多已经完成的折页调查提供“全局视图”。系统的中心思想是通过排名进行比较; PartsList将基于180多个属性对大约420折进行排名。这些包括:(i)存在于许多完全测序的基因组中(例如,它将显示出蠕虫相对于酵母最常见的折叠); (ii)出现在结构数据库中(例如PDB中最常见的折叠); (iii)绝对和相对基因表达信息(例如,整个细胞周期中大多数表达变化的折叠); (iv)蛋白质-蛋白质相互作用,基于酵母的实验数据和全面的PDB调查(例如,大多数相互作用的折叠); (v)对插入的转座子的敏感性; (vi)与折叠相关的功能数量(例如,大多数多功能折叠); (vii)氨基酸组成(例如,大多数富含Cys的折叠); (viii)蛋白质运动(例如大多数活动折叠); (ix)基于一组全面的结构比对(例如,大多数结构可变的折叠)的相似度。全基因组表达和蛋白质-蛋白质相互作用数据与结构信息的集成是我们系统的一个特别新颖的功能。我们提供了三种可视化排名的方法:一个通过许多预选属性来强调高低排名进展的探查器,一个用于自定义比较的动态比较器以及一个数字排名相关器。这些使人们能够以等级的统一数值格式直接比较折叠的非常不同的属性(例如表达水平,基因组发生和最大运动)。反过来,这种统一的框架突出了许多属性的频率随近似幂律行为而下降的方式(即,对于属性值V和常数指数b,根据V –b ) ),其中有几折具有较大的价值,而多数则具有较小的价值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号