首页> 外文学位 >Protein functionality analysis through protein-protein interaction networks.
【24h】

Protein functionality analysis through protein-protein interaction networks.

机译:通过蛋白质-蛋白质相互作用网络进行蛋白质功能分析。

获取原文
获取原文并翻译 | 示例

摘要

Since the sequencing of the human genome was brought to fruition [142, 66], the field of genetics now stands on the threshold of significant theoretical and practical advances. Crucial to furthering these investigations is a comprehensive understanding of the expression, function, and regulation of the proteins encoded by an organism [155]. This understanding is the subject of the discipline of proteomics. Proteomics encompasses a wide range of approaches and applications intended to explicate how complex biological processes occur at a molecular level, how they differ in various cell types, and how they are altered in disease states.;In particular, the elucidation of protein function has been, and remains, one of the most central problems in computational biology. Proteins are macromolecules that serve as building blocks and functional components of a cell, and account for the second largest fraction of the cellular weight after water. Proteins are responsible for some of the most important functions in an organism, such as constitution of the organs (structural proteins), the catalysis of biochemical reactions necessary for metabolism (enzymes), and the maintenance of the cellular environment (transmembrane proteins). Thus, proteins are the most essential and versatile macromolecules of life, and the knowledge of their functions is a crucial link in the development of new drugs, better crops, and even the development of synthetic biochemicals such as biofuels [49].;A recent review noted that a large fraction of currently sequenced complete genomes has at least half of their gene entries having ambiguous annotations [59]. The classical way to predict protein functions is to find homologies between an unannotated protein and other proteins using sequence similarity algorithms, such as FASTA [106] and PSI-BLAST [85]. The function of the unannotated protein can then be assigned according to the annotated proteins with similar sequences. In addition, several computational approaches are proposed based on correlated evolution mechanisms of genes. For example, the domain fusion analysis infers that a pair of proteins interacts with each other and thus performs related functions [88].;In recent years, the high-throughput bio-techniques have provided additional opportunities for inference of protein functions. Protein-protein interaction (PPI) data, enriched by high-throughput experiments including yeast two-hybrid analysis [67][139], mass spectrometry [50][62] and synthetic lethality screen [3], have provided the important clues of functional associations between proteins. It has been observed that proteins seldom act as single isolated species in the performance of their functions; rather, proteins involved in the same cellular processes often interact with each other. Therefore, the functions of uncharacterized proteins can be predicted through comparison with the interactions of similar known proteins. A detailed examination of a protein-protein interaction network can thus yield significant new insights into protein functions.;The protein interaction network can be described as a complex system of proteins linked by interactions. Examples of complex systems include social networks, the World Wide Web and biological systems such as metabolic networks, gene regulatory networks and protein interaction networks. Recently, the studies of complex systems [4, 96] have attempted to understand and characterize the structural behaviors of the systems in a topological perspective. As interesting features, the small-world properties [147], scale-free degree distributions [14, 15] and hierarchical modularity [115] have been observed in complex systems. These unique characteristics can facilitate the efficient and accurate analysis of protein interaction networks.;The study in this dissertation is based on the theoretical and quantitative characterization of protein interaction networks. In particular, it focuses on the efficient and accurate analysis of protein functionality through protein interaction networks. As a preliminary work of this dissertation, the reliabilities of protein-protein interactions are assessed using the topological features of the network. A novel measurement to quantify the interaction is proposed. In the next step, the weighted interaction networks of proteins, which are created by assigning the reliability to each edge as a weight, are used to detect functional modules and predict protein functions. Novel algorithms, based on information flow, spanning tree, and neural networks are proposed. This study can be underlying bases for functional characterization of proteins and also provides a guideline for the deeper analysis of biological networks such as network inference and network dynamics.
机译:自从人类基因组测序获得了成果[142,66]以来,遗传学领域就处于重要的理论和实践进展的门槛。深入开展这些研究的关键是对生物编码的蛋白质的表达,功能和调控的全面理解[155]。这种理解是蛋白质组学学科的主题。蛋白质组学涵盖了广泛的方法和应用,旨在阐明复杂的生物学过程如何在分子水平上发生,它们在各种细胞类型中如何不同以及在疾病状态下如何发生变化。特别是,已经阐明了蛋白质功能仍然是计算生物学中最核心的问题之一。蛋白质是大分子,充当细胞的组成部分和功能组件,占细胞重量中仅次于水的第二大部分。蛋白质负责生物体中一些最重要的功能,例如器官的组成(结构蛋白),催化代谢所需的生化反应(酶)和维持细胞环境(跨膜蛋白)。因此,蛋白质是生命中最重要,用途最广泛的大分子,其功能知识是开发新药,改良农作物甚至发展合成生物化学物质(如生物燃料)的关键环节[49]。综述指出,目前测序的完整基因组的很大一部分其基因条目中至少有一半具有模糊注解[59]。预测蛋白质功能的经典方法是使用序列相似性算法(例如FASTA [106]和PSI-BLAST [85])在未注释的蛋白质和其他蛋白质之间寻找同源性。然后可以根据具有相似序列的带注释的蛋白质分配未带注释的蛋白质的功能。此外,基于相关的基因进化机制,提出了几种计算方法。例如,域融合分析推断一对蛋白质相互作用,从而执行相关功能[88]。近年来,高通量生物技术为推断蛋白质功能提供了更多机会。通过高通量实验(包括酵母双杂交分析[67] [139],质谱[50] [62]和合成致死率筛查[3])进行的高通量实验得到了丰富的蛋白质-蛋白质相互作用(PPI)数据,提供了重要的线索蛋白质之间的功能关联。已经观察到蛋白质在其功能执行中很少充当单个分离的物种。相反,参与同一细胞过程的蛋白质通常会相互影响。因此,可以通过与相似的已知蛋白质的相互作用进行比较来预测未表征蛋白质的功能。因此,对蛋白质-蛋白质相互作用网络的详细检查可以对蛋白质功能产生重要的新见解。蛋白质相互作用网络可以描述为由相互作用连接的复杂蛋白质系统。复杂系统的示例包括社交网络,万维网以及诸如代谢网络,基因调控网络和蛋白质相互作用网络等生物系统。最近,对复杂系统的研究[4,96]试图从拓扑的角度理解和表征系统的结构行为。作为有趣的功能,在复杂系统中已观察到小世界属性[147],无标度分布[14、15]和分层模块化[115]。这些独特的特征可以促进蛋白质相互作用网络的高效和准确分析。本文的研究基于蛋白质相互作用网络的理论和定量表征。特别是,它着重于通过蛋白质相互作用网络对蛋白质功能进行高效,准确的分析。作为本论文的初步工作,利用网络的拓扑特征来评估蛋白质相互作用的可靠性。提出了一种新颖的方法来量化相互作用。在下一步中,通过将可靠性分配给每个边作为权重来创建蛋白质的加权交互网络,以检测功能模块并预测蛋白质功能。提出了一种基于信息流,生成树和神经网络的新算法。这项研究可以作为蛋白质功能表征的基础,也可以为深入分析生物网络(如网络推断和网络动力学)提供指导。

著录项

  • 作者

    Shi, Lei.;

  • 作者单位

    State University of New York at Buffalo.;

  • 授予单位 State University of New York at Buffalo.;
  • 学科 Biology Bioinformatics.;Computer Science.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 194 p.
  • 总页数 194
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号