首页> 外文会议>International Conference on Bioinformatics and Biomedical Technology >Homology Based Multi-instance Kernel Combination for Gram-negative Protein Subcelluar Localization
【24h】

Homology Based Multi-instance Kernel Combination for Gram-negative Protein Subcelluar Localization

机译:基于同源性的多实例核组合,用于革兰阴性蛋白质群定位

获取原文

摘要

Previous computational models generally exclude homology out of the training set to reduce potential predictive bias. This paper proposes a hierarchical kernel to incorporate homology for more accurate similarity definition between two protein sequences. Metaphorized as the scenario of multi-instance learning, a homologous sequence is viewed as one evolutionary instance of the target sequence and all the homologous sequences constitute one homology bag. The bottom-level kernel is defined as k-mer spectrum kernel to define the similarity between any two instances; the middle-level multi-instance kernel is defined as the sum of all the spectrum kernels, actually the similarity definition between two homology bags, called Homology-based Multi-instance Kernel (HoMIKernel). By varying k-mer size and compressing 20 amino acids, we can derive multiple HoMIKernels, which are further combined into the top-level kernel called HoMIKernel+ to capture more contextual information and cover size-varying motifs. We evaluate HoMIKernel+ on Gram-negative benchmark dataset. The experiments show that HoMIKernel+ achieves better predictive performance than the baseline models and the incorporation of homologous sequences does increase the predictive performance.
机译:以前的计算模型通常排除训练集的同源性,以减少潜在的预测偏差。本文提出了一种分层内核,可以在两个蛋白质序列之间掺入同源性以获得更准确的相似性定义。作为多实例学习的场景中喻,将同源序列视为靶序列的一个进化实例,并且所有同源序列构成一个同源袋。底层内核定义为k-mer频谱内核,以定义任何两个实例之间的相似性;中级多实例内核被定义为所有Spectrum内核的总和,实际上两个同源袋之间的相似性定义,称为基于同源的多实例内核(Homikernel)。通过改变k-mer大小并压缩20个氨基酸,我们可以导出多个蜂鸣形,它们进一步组合到称为homikernel +的顶级内核,以捕获更多的上下文信息和覆盖大小不同的图案。我们在革兰氏否定基准数据集上评估homikernel +。实验表明,Homikernel +实现了比基线模型更好的预测性能,并且同源序列的掺入确实增加了预测性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号