...
首页> 外文期刊>Bioinformatics >Efficient remote homology detection using local structure.
【24h】

Efficient remote homology detection using local structure.

机译:使用本地结构进行有效的远程同源性检测。

获取原文
获取原文并翻译 | 示例
           

摘要

MOTIVATION: The function of an unknown biological sequence can often be accurately inferred if we are able to map this unknown sequence to its corresponding homologous family. At present, discriminative methods such as SVM-Fisher and SVM-pairwise, which combine support vector machine (SVM) and sequence similarity, are recognized as the most accurate methods, with SVM-pairwise being the most accurate. However, these methods typically encode sequence information into their feature vectors and ignore the structure information. They are also computationally inefficient. Based on these observations, we present an alternative method for SVM-based protein classification. Our proposed method, SVM-I-sites, utilizes structure similarity for remote homology detection. RESULT: We run experiments on the Structural Classification of Proteins 1.53 data set. The results show that SVM-I-sites is more efficient than SVM-pairwise. Further, we find that SVM-I-sites outperforms sequence-based methods such as PSI-BLAST, SAM, and SVM-Fisher while achieving a comparable performance with SVM-pairwise. AVAILABILITY: I-sites server is accessible through the web at http://www.bioinfo.rpi.edu. Programs are available upon request for academics. Licensing agreements are available for commercial interests. The framework of encoding local structure into feature vector is available upon request.
机译:动机:如果我们能够将未知序列映射到其相应的同源家族,通常可以准确地推断出未知生物学序列的功能。目前,将支持向量机(SVM)和序列相似性相结合的区分方法(例如SVM-Fisher和SVM-成对)被认为是最准确的方法,其中SVM-成对是最准确的。但是,这些方法通常将序列信息编码到其特征向量中,而忽略结构信息。它们在计算上也效率低下。基于这些观察,我们提出了一种基于SVM的蛋白质分类的替代方法。我们提出的方法SVM-I-sites利用结构相似性进行远程同源性检测。结果:我们对蛋白质的结构分类1.53数据集进行实验。结果表明,SVM-I站点比SVM-pairwise更有效。此外,我们发现SVM-I站点优于基于序列的方法(例如PSI-BLAST,SAM和SVM-Fisher),同时可与SVM-pairwise实现可比的性能。可用性:I网站服务器可通过以下网站访问:http://www.bioinfo.rpi.edu。可应要求为学者提供课程。许可协议可用于商业利益。可根据要求提供将局部结构编码为特征向量的框架。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号