首页> 外文会议>2010 International Conference on Bioinformatics and Biomedical Technology (ICBBT 2010) >Homology based Multi-instance Kernel combination for Gram-negative protein subcelluar localization
【24h】

Homology based Multi-instance Kernel combination for Gram-negative protein subcelluar localization

机译:基于同源性的多实例核组合用于革兰氏阴性蛋白亚细胞定位

获取原文
获取原文并翻译 | 示例

摘要

Previous computational models generally exclude homology out of the training set to reduce potential predictive bias. This paper proposes a hierarchical kernel to incorporate homology for more accurate similarity definition between two protein sequences. Metaphorized as the scenario of multi-instance learning, a homologous sequence is viewed as one evolutionary instance of the target sequence and all the homologous sequences constitute one homology bag. The bottom-level kernel is defined as k-mer spectrum kernel to define the similarity between any two instances; the middle-level multi-instance kernel is defined as the sum of all the spectrum kernels, actually the similarity definition between two homology bags, called Homology-based Multi-instance Kernel (HoMIKernel). By varying k-mer size and compressing 20 amino acids, we can derive multiple HoMIKernels, which are further combined into the top-level kernel called HoMIKernel+ to capture more contextual information and cover size-varying motifs. We evaluate HoMIKernel+ on Gram-negative benchmark dataset. The experiments show that HoMIKernel+ achieves better predictive performance than the baseline models and the incorporation of homologous sequences does increase the predictive performance.
机译:先前的计算模型通常将同源性排除在训练集中之外,以减少潜在的预测偏差。本文提出了一个层次核,将同源性纳入两个蛋白质序列之间的更准确的相似性定义。隐喻作为多实例学习的场景,同源序列被视为目标序列的一个进化实例,所有同源序列构成一个同源袋。底层内核定义为k-mer光谱内核,以定义任意两个实例之间的相似性;中层多实例内核被定义为所有频谱内核的总和,实际上是两个同源袋之间的相似性定义,称为基于同源性的多实例内核(HoMIKernel)。通过改变k-mer大小并压缩20个氨基酸,我们可以得到多个HoMIKernels,它们进一步组合到称为HoMIKernel +的顶级内核中,以捕获更多上下文信息并覆盖大小可变的基序。我们在革兰氏阴性基准数据集上评估HoMIKernel +。实验表明,与基线模型相比,HoMIKernel +具有更好的预测性能,而同源序列的引入确实提高了预测性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号