首页> 外文期刊>BMC Bioinformatics >Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases
【24h】

Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases

机译:敏感集合启发式方法在非常大的蛋白质家族中的应用:甘油三酸酯脂肪酶(ATGL)与经典哺乳动物脂肪酶之间的进化关系

获取原文
           

摘要

Background Manually finding subtle yet statistically significant links to distantly related homologues becomes practically impossible for very populated protein families due to the sheer number of similarity searches to be invoked and analyzed. The unclear evolutionary relationship between classical mammalian lipases and the recently discovered human adipose triglyceride lipase (ATGL; a patatin family member) is an exemplary case for such a problem. Results We describe an unsupervised, sensitive sequence segment collection heuristic suitable for assembling very large protein families. It is based on fan-like expanding, iterative database searches. To prevent inclusion of unrelated hits, additional criteria are introduced: minimal alignment length and overlap with starting sequence segments, finding starting sequences in reciprocal searches, automated filtering for compositional bias and repetitive patterns. This heuristic was implemented as FAMILYSEARCHER in the ANNIE sequence analysis environment and applied to search for protein links between the classical lipase family and the patatin-like group. Conclusion The FAMILYSEARCHER is an efficient tool for tracing distant evolutionary relationships involving large protein families. Although classical lipases and ATGL have no obvious sequence similarity and differ with regard to fold and catalytic mechanism, homology links detected with FAMILYSEARCHER show that they are evolutionarily related. The conserved sequence parts can be narrowed down to an ancestral core module consisting of three β-strands, one α-helix and a turn containing the typical nucleophilic serine. Moreover, this ancestral module also appears in numerous enzymes with various substrate specificities, but that critically rely on nucleophilic attack mechanisms.
机译:背景技术由于需要调用和分析大量相似搜索,因此对于人口众多的蛋白家族来说,手动找到与远距离相关同源物的微妙但在统计上有意义的链接实际上变得不可能。经典哺乳动物脂肪酶与最近发现的人脂肪甘油三酸酯脂肪酶(ATGL; patatin家族成员)之间的进化关系不清楚,就是此类问题的例证。结果我们描述了一种无监督的,敏感的序列片段集合启发式方法,适用于组装非常大的蛋白质家族。它基于类似扇形的扩展迭代数据库搜索。为防止包含不相关的匹配,引入了其他标准:最小的比对长度和与起始序列片段的重叠,在相互搜索中查找起始序列,自动过滤成分偏倚和重复模式。这种启发式方法在ANNIE序列分析环境中以FAMILYSEARCHER的形式实现,并用于搜索经典脂肪酶家族与patatin类组之间的蛋白质联系。结论FAMILYSEARCHER是追踪涉及大蛋白家族的远距离进化关系的有效工具。尽管经典的脂肪酶和ATGL没有明显的序列相似性,并且在折叠和催化机理方面没有区别,但是用FAMILYSEARCHER检测到的同源性联系表明它们是进化相关的。保守的序列部分可以缩小到一个祖先的核心模块,该模块由三个β链,一个α螺旋和一个包含典型亲核丝氨酸的序列组成。此外,这种祖先模块也出现在具有不同底物特异性的多种酶中,但主要依赖亲核攻击机制。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号