首页> 外文期刊>Methods: A Companion to Methods in Enzymology >Proteome compression via protein domain compositions
【24h】

Proteome compression via protein domain compositions

机译:通过蛋白质结构域组成的蛋白质组压缩

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we study domain compositions of proteins via compression of whole proteins in an organism for the sake of obtaining the entropy that the individual contains. We suppose that a protein is a multiset of domains. Since gene duplication and fusion have occurred through evolutionary processes, the same domains and the same compositions of domains appear in multiple proteins, which enables us to compress a proteome by using references to proteins for duplicated and fused proteins. Such a network with references to at most two proteins is modeled as a directed hypergraph. We propose a heuristic approach by combining the Edmonds algorithm and an integer linear programming, and apply our procedure to 14 proteomes of Dictyostelium discoideum, Escherichia coli, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Caenorhabditis elegans, Drosophila melanogaster, Arabidopsis thaliana, Oryza sativa, Danio rerio, Xenopus laevis, Gallus gallus, Mus musculus, Pan troglodytes, and Homo sapiens. The compressed size using both of duplication and fusion was smaller than that using only duplication, which suggests the importance of fusion events in evolution of a proteome.
机译:在本文中,为了获得个体所包含的熵,我们研究了通过压缩生物体中完整蛋白质的蛋白质的域组成。我们假设蛋白质是多个结构域。由于基因的复制和融合是通过进化过程发生的,因此相同的结构域和相同的结构域组合物会出现在多种蛋白质中,这使我们能够通过引用复制和融合蛋白质的蛋白质来压缩蛋白质组。最多引用两种蛋白质的此类网络被建模为有向超图。我们提出了一种结合埃德蒙兹算法和整数线性规划的启发式方法,并将我们的方法应用于盘状双歧杆菌,大肠埃希氏菌,酿酒酵母,裂殖酵母,线虫Caenorhabditis elegans,果蝇黑腹果蝇,拟南芥,非洲爪蟾(Xenopus laevis),鸡(Gallus gallus),小家鼠(Mus musculus),泛穴居人和智人。使用复制和融合的压缩大小均小于仅使用复制的压缩大小,这表明融合事件在蛋白质组进化中的重要性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号