Motif discovery for proteins using subsequence clustering

机译：使用子序列聚类发现蛋白质的母题

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose an algorithm for discovering motifs using clustering of subsequences. In our previous approach, we were successful in guiding motif discovery by sampling subsequences and inputting them to an existing motif discovery tool MEME. In this paper, we show that clustering subsequences can also detect motifs without using other motif discovery tools. Generally, motif discovery algorithms do not perform well when the input set consists of non-homogeneous sequences. Clustering tools have the inherent ability to generate clusters of homogeneous sequences when the input sequences are non-homogeneous. For this reason, we use our clustering algorithm to generate aligned subsequence clusters and then rank them according to their information contents to produce final motifs. The algorithm was tested with PROSITE database and the results suggest that the algorithm is very effective in finding motifs even when input sequences are from different protein families.

机译：我们提出了一种使用子序列聚类发现主题的算法。在我们以前的方法中，我们通过对子序列进行采样并将其输入到现有的主题发现工具MEME中，成功地指导了主题的发现。在本文中，我们证明了聚类子序列也可以检测主题，而无需使用其他主题发现工具。通常，当输入集由非均质序列组成时，基序发现算法不能很好地执行。当输入序列不均匀时，聚类工具具有生成均匀序列簇的固有能力。因此，我们使用聚类算法生成对齐的子序列聚类，然后根据其信息内容对它们进行排序，以生成最终的图案。该算法在PROSITE数据库中进行了测试，结果表明，即使输入序列来自不同的蛋白质家族，该算法在查找基序方面也非常有效。

著录项

来源
《Proceedings of the 5th international workshop on Bioinformatics》|2005年|P.3-6|共4页
会议地点 Chicago IL(US)
作者
Hardik A. Sheth; Sun Kim;
展开▼
作者单位

Indiana University, Bloomington, IN;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
subsequences;

机译：子序列;

相似文献

外文文献
中文文献
专利

1. Motif discovery with data mining in 3D protein structure databases: Discovery, validation and prediction of the u-shape zinc binding (HUF-ZINC) motif [J] . Maurer-Stroh S., Gao H., Han H., Journal of Bioinformatics and Computational Biology . 2013,第1期

机译：通过3D蛋白质结构数据库中的数据挖掘进行母题发现：u形锌结合（ HUF-ZINC）主题的发现，验证和预测
2. Scalable Discovery of Audio Fingerprint Motifs in Broadcast Streams With Determinantal Point Process Based Motif Clustering [J] . H. Xu, Z. Ou Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第5期

机译：基于确定性点过程的母题聚类在广播流中音频指纹母题的可扩展发现
3. Combining phylogenetic motif discovery and motif clustering to predict co-regulated genes [J] . Shane T. Jensen, Lei Shen, Jun S. Liu Bioinformatics . 2005,第20期

机译：结合系统发生的基序发现和基序聚类来预测共同调控的基因
4. Motif discovery for proteins using subsequence clustering [C] . Hardik A. Sheth, Sun Kim International workshop on Bioinformatics . 2005

机译：使用子序列聚类的蛋白质的主题发现
5. A protein structure alignment method and application to the discovery of recurrent protein structure motifs. [D] . Szustakowski, Joseph Daniel. 2003

机译：一种蛋白质结构比对方法及其在发现复发性蛋白质结构基序中的应用。
6. Evolution of the Twist Subfamily Vertebrate Proteins: Discovery of a Signature Motif and Origin of the Twist1 Glycine-Rich Motifs in the Amino-Terminus Disordered Domain [O] . Yacidzohara Rodriguez, Ricardo R. Gonzalez-Mendez, Carmen L. Cadilla 2011

机译：Twist亚家族脊椎动物蛋白的进化：氨基末端无序域中一个签名母题和Twist1富含甘氨酸母题的起源的发现。
7. Motif Discovery for Proteins Using Subsequence Clustering [O] . Hardik A. Sheth, Sun Kim 2010

机译：使用子序列聚类的蛋白质基序发现

Motif discovery for proteins using subsequence clustering

摘要

著录项

相似文献

相关主题

期刊订阅