Automatic generation of primary sequence patterns from sets of related protein sequences.

机译：从相关蛋白序列集自动生成一级序列模式。

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

We have developed a computer algorithm that can extract the pattern of conserved primary sequence elements common to all members of a homologous protein family. The method involves clustering the pairwise similarity scores among a set of related sequences to generate a binary dendrogram (tree). The tree is then reduced in a stepwise manner by progressively replacing the node connecting the two most similar termini by one common pattern until only a single common "root" pattern remains. A pattern is generated at a node by (i) performing a local optimal alignment on the sequence/pattern pair connected by the node with the use of an extended dynamic programming algorithm and then (ii) constructing a single common pattern from this alignment with a nested hierarchy of amino acid classes to identify the minimal inclusive amino acid class covering each paired set of elements in the alignment. Gaps within an alignment are created and/or extended using a "pay once" gap penalty rule, and gapped positions are converted into gap characters that function as 0 or 1 amino acid of any type during subsequent alignment. This method has been used to generate a library of covering patterns for homologous families in the National Biomedical Research Foundation/Protein Identification Resource protein sequence data base. We show that a covering pattern can be more diagnostic for sequence family membership than any of the individual sequences used to construct the pattern.

机译：我们已经开发了一种计算机算法，可以提取出同源蛋白家族所有成员共有的保守一级序列元素的模式。该方法包括将一组相关序列之间的成对相似性得分聚类以生成二进制树状图（树）。然后通过逐步地用一个公共模式替换连接两个最相似终端的节点，直到仅剩下一个公共“根”模式，以逐步方式减少树。通过（i）使用扩展的动态编程算法对节点连接的序列/模式对执行局部最优比对，然后（ii）根据这种比对，利用一个嵌套的氨基酸类别层次结构，以识别覆盖比对中每个成对元素的最小包含氨基酸类别。使用“一次支付”的间隙罚分规则来创建和/或扩展比对内的间隙，并且在随后的比对期间，带间隙的位置被转换为用作任何类型的0或1个氨基酸的间隙字符。在国家生物医学研究基金会/蛋白质鉴定资源蛋白质序列数据库中，该方法已用于生成覆盖同源库的模式库。我们表明，与用于构建模式的任何单个序列相比，覆盖模式对序列家族成员的诊断能力更高。

著录项

期刊名称 Proceedings of the National Academy of Sciences of the United States of America
作者
R F Smith; T F Smith;
展开▼
作者单位

展开▼
年(卷),期 1990(87),1
年度 1990
页码 118–122
总页数 5
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Identifying DNA and protein patterns with statistically significant alignments of multiple sequences. [J] . Hertz GZ, Stormo GD Bioinformatics . 1999,第7a8期

机译：通过多个序列的统计学显着比对鉴定DNA和蛋白质模式。
2. Classification of G-protein coupled receptors by alignment-independent extraction of principal chemical properties of primary amino acid sequences. [J] . Lapinsh M, Gutcaits A, Prusis P, Protein Science: A Publication of the Protein Society . 2002,第4期

机译：通过比对依赖性提取一级氨基酸序列的主要化学性质对G蛋白偶联受体进行分类。
3. Serum antibodies from patients with primary Sjogren's syndrome and systemic lupus erythematosus recognize multiple epitopes on the La(SS-B) autoantigen resembling viral protein sequences. [J] . Haaheim LR, Halse AK, Kvakestad R, Scandinavian journal of immunology. . 1996,第1期

机译：原发性干燥综合征和系统性红斑狼疮患者的血清抗体可识别La（SS-B）自身抗原上类似于病毒蛋白序列的多个表位。
4. Prediction of Helix, Strand Segments from Primary Protein Sequences by a Set of Neural Networks [C] . Zhuo Song, Ning Zhang, ZhuoYang, Advances in Neural Networks - ISNN 2007 pt.2; Lecture Notes in Computer Science; 4492 . 2007

机译：通过一组神经网络从一级蛋白质序列预测螺旋，链段
5. Cellular pattern quantication and automatic bench-marking data-set generation on confocal microscopy images. [D] . Cui, Chi. 2010

机译：共聚焦显微镜图像上的细胞模式量化和自动基准数据集生成。
6. Finding flexible patterns in unaligned protein sequences. [O] . I. Jonassen, J. F. Collins, D. G. Higgins 1995

机译：在未比对的蛋白质序列中找到灵活的模式。
7. Automatic generation of primary sequence patterns from sets of related protein sequences. [O] . Smith, R F, Smith, T F 1990

机译：从相关蛋白序列集自动生成一级序列模式。

Automatic generation of primary sequence patterns from sets of related protein sequences.

摘要

著录项

相似文献

相关主题

期刊订阅