首页> 美国卫生研究院文献>Nucleic Acids Research >SALAD database: a motif-based database of protein annotations for plant comparative genomics
【2h】

SALAD database: a motif-based database of protein annotations for plant comparative genomics

机译:SALAD数据库:基于基序的蛋白质注释数据库用于植物比较基因组学

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Proteins often have several motifs with distinct evolutionary histories. Proteins with similar motifs have similar biochemical properties and thus related biological functions. We constructed a unique comparative genomics database termed the SALAD database () from plant-genome-based proteome data sets. We extracted evolutionarily conserved motifs by MEME software from 209 529 protein-sequence annotation groups selected by BLASTP from the proteome data sets of 10 species: rice, sorghum, Arabidopsis thaliana, grape, a lycophyte, a moss, 3 algae, and yeast. Similarity clustering of each protein group was performed by pairwise scoring of the motif patterns of the sequences. The SALAD database provides a user-friendly graphical viewer that displays a motif pattern diagram linked to the resulting bootstrapped dendrogram for each protein group. Amino-acid-sequence-based and nucleotide-sequence-based phylogenetic trees for motif combination alignment, a logo comparison diagram for each clade in the tree, and a Pfam-domain pattern diagram are also available. We also developed a viewer named ‘SALAD on ARRAYs’ to view arbitrary microarray data sets of paralogous genes linked to the same dendrogram in a window. The SALAD database is a powerful tool for comparing protein sequences and can provide valuable hints for biological analysis.
机译:蛋白质通常具有几个具有不同进化史的基序。具有相似基序的蛋白质具有相似的生化特性,因此具有相关的生物学功能。我们从基于植物基因组的蛋白质组数据集构建了一个独特的比较基因组数据库,称为SALAD数据库()。我们通过MEME软件从BLASTP从10种蛋白质组数据集中选择的209529个蛋白质序列注释组中提取了进化保守的基序:水稻,高粱,拟南芥,葡萄,苔藓植物,苔藓,3种藻类和酵母。每个蛋白质组的相似性聚类是通过对序列的基序模式进行成对评分来进行的。 SALAD数据库提供了一个用户友好的图形查看器,该图形查看器显示了与每个蛋白质组生成的自举树状图链接的基序图案图。还提供用于基序组合对齐的基于氨基酸序列和基于核苷酸序列的系统发育树,树中每个进化枝的徽标比较图以及Pfam域结构图。我们还开发了一个名为“ SALAD on ARRAYs”的查看器,以在窗口中查看与同一树状图链接的旁系同源基因的任意微阵列数据集。 SALAD数据库是比较蛋白质序列的强大工具,可以为生物学分析提供有价值的提示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号