首页> 外文会议>International Conference on Communication Technology and System Design >Extraction of Motif Patterns from Protein Sequence Using Rough- K-Means Algorithm
【24h】

Extraction of Motif Patterns from Protein Sequence Using Rough- K-Means Algorithm

机译:使用粗糙-K型算法从蛋白质序列中提取基序模式

获取原文
获取外文期刊封面目录资料

摘要

Bioinformatics is the application of computer technology to the management of biological information. In Bioinformatics, Motif finding is one of the most popular problems, which has many applications. It is the process of locating the meaningful patterns in the sequence of Deoxyribo Nucleic Acid (DNA), Ribo Nucleic Acid (RNA) or Proteins. Motifs vary in lengths, positions, redundancy, orientation and bases. Finding these short sequences (motifs or signals) is a fundamental problem in molecular biology and computer science with important applications such as knowledge-based drug design, forensic DNA analysis, and agricultural biotechnology. In this work, the clustering system is used to predict local protein sequence Motifs. Since clustering algorithms can provide an automatic, unsupervised discovery process for sequence motifs, the K-Means clustering algorithm and Rough-K-means algorithm proposed are chosen as the motif discovery method for this study and the results are compared. The structural similarity of the clusters discovered by the proposed approach is studied to analyze how the recurring patterns correlate with its structure. Also, some biochemical references are included in our evaluation.
机译:生物信息学是计算机技术在生物信息管理中的应用。在生物信息学中,图案发现是最受欢迎的问题之一,具有许多应用。它是在脱氧核酸核酸(DNA),核核酸(RNA)或蛋白质的序列中定位有意义的图案的过程。主题在长度,位置,冗余,方向和基础中变化。寻找这些短序列(图案或信号)是分子生物学和计算机科学的基本问题,具有基于知识的药物设计,法医DNA分析和农业生物技术等重要应用。在这项工作中,聚类系统用于预测局部蛋白质序列图案。由于聚类算法可以为序列图案提供自动,无监督的发现过程,因此提出的K-Means聚类算法和粗糙-K平均算法作为本研究的基序发现方法,并比较结果。研究了所提出的方法发现的集群的结构相似性,分析了复发模式如何与其结构相关。此外,我们的评估中包含一些生化参考文献。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号