首页> 外文会议>International Conference on Intelligent Systems for Molecular Biology >Protein Family Classification using Sparse Markov Transducers
【24h】

Protein Family Classification using Sparse Markov Transducers

机译:使用稀疏马尔可夫传感器蛋白质分类

获取原文

摘要

In this paper we present a method for classifying proteins into families using sparse Markov transducers (SMTs). Sparse Markov transducers, similar to probabilistic suffix trees, estimate a probability distribution conditioned on an input sequence. SMTs generalize probabilistic suffix trees by allowing for wild-cards in the conditioning sequences. Because substitutions of amino acids are common in protein families, incorporating wild-cards into the model significantly improves classification performance. We present two models for building protein family classifiers using SMTs. We also present efficient data structures to improve the memory usage of the models. We evaluate SMTs by building protein family classifiers using the Pfam database and compare our results to previously published results.
机译:在本文中,我们介绍了一种使用稀疏马尔可夫传感器(SMT)将蛋白质分类为家族的方法。稀疏马尔可夫传感器,类似于概率后缀树,估计在输入序列上调节的概率分布。通过允许调节序列中的野卡揭示概率概率的后缀树。由于氨基酸的替代在蛋白质家族中常见,因此将野生卡掺入模型中显着提高了分类性能。我们为使用SMT提供两种模型来建立蛋白质分类器。我们还提出了高效的数据结构来提高模型的内存使用情况。我们通过使用PFAM数据库建立蛋白质分类器来评估SMT,并将我们的结果与以前公布的结果进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号