首页> 外文会议>International Conference on Advanced Electronic Materials, Computers and Software Engineering >Human protein subcellular localization prediction based on error correcting output coding strategy while combining immunohistochemistry image and amino acid sequence
【24h】

Human protein subcellular localization prediction based on error correcting output coding strategy while combining immunohistochemistry image and amino acid sequence

机译:基于误差校正输出编码策略的人蛋白质亚细胞定位预测,同时组合免疫组织化学图像和氨基酸序列

获取原文

摘要

Human protein subcellular localization prediction is essential for understanding function of proteins. Only when the proteins arrive at the right place at the right time can they perform their functions. In the past decades, various methods based on machine learning have been proposed to predict the subcellular localization of proteins. However, a common drawback is that most of these methods only focus on single signal of proteins, such as amino acid sequences or protein images. In this paper, a novel prediction model based on error correcting output coding (ECOC) strategy was proposed, which named ECOC-locator. It has two advantages, one is that ECOC-locator takes full advantage of amino acid sequence information, for example, rich context information of amino acid sequences, and takes interpretability of protein images into account. The other is that ECOC-locator balances the class contributions of the two signals to the prediction model by employing ECOC. The ensemble classifier integrated 8 classifiers, five of which were trained by immunohistochemistry (IHC) images and the rest trained by amino acid sequence data. The experimental results show that the accuracy of ECOC-locator can reach 40.34%, which fully demonstrates the effectiveness.
机译:人蛋白亚细胞定位预测对于了解蛋白质的功能是必不可少的。只有当蛋白质在正确的时间到达正确的地方时,他们才能执行它们的功能。在过去的几十年中,已经提出了基于机器学习的各种方法来预测蛋白质的亚细胞定位。然而,共同的缺点是大多数这些方法仅关注蛋白质的单个信号,例如氨基酸序列或蛋白质图像。本文提出了一种基于纠错输出编码(ECOC)策略的新型预测模型,其命名为ECOC定位器。它具有两个优点,一个是ecoc定位器充分利用氨基酸序列信息,例如氨基酸序列的丰富的上下文信息,并考虑蛋白质图像的解释性。另一种是,Ecoc定位器通过使用EcoC来平衡两个信号的课程到预测模型。集合分类器集成了8分类器,其中五种分类器由免疫组织化学(IHC)图像训练,并且氨基酸序列数据训练的其余培训。实验结果表明,ecoC定位器的准确性可以达到40.34%,这完全证明了效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号