首页> 外文期刊>Biomedical signal processing and control >A novel entropy-based mapping method for determining the protein-protein interactions in viral genomes by using coevolution analysis
【24h】

A novel entropy-based mapping method for determining the protein-protein interactions in viral genomes by using coevolution analysis

机译:一种基于新的基于熵的测绘方法,用于使用共聚分析测定病毒基因组中的蛋白质 - 蛋白质相互作用

获取原文
获取原文并翻译 | 示例

摘要

Protein-protein interactions have a vital role in DNA transcription, immune system, and signal transmission between cells. Determining the interactions between proteins can give information about the functional structure of a cell and the functions of target organisms. Protein-protein interactions are determined by experimental approaches, yet, there is still a huge gap in specifying all possible protein interactions in an organism. Furthermore, since these approaches use cloning, labeling, and affinity mass spectrometry, the analysis process is time-consuming and expensive. However, analyzing the protein interactions with computational approaches based on coevolution theory eliminate these kinds of limitations, since in the coevolution theory model, interacting proteins show coevolutionary mutations and form similar phylogenetic trees. Current coevolution methods are based on the multiple-sequence alignment process; yet many high false positive interactions arise with these methods. Therefore, it is important to perform computational-based coevolution analysis. Protein-protein interaction using coevolution analysis has been employed in conjunction with experimental approaches to explore new protein interactions. However, in order to predict protein interactions with computational-based coevolution analysis, protein sequences need to be mapped. There are various types of protein mapping methods belonging to certain categories in the literature. These methods are frequently used in studies of predicting protein interactions. In this study, as an alternative to these methods, we proposed a novel entropy-based protein mapping method and predicted protein-protein interactions in viral genomes by using coevolution analysis. The study consists of 5 stages. In the first stage, the protein sequences of viral genomes were mapped using both the proposed numerical mapping method and state-of-arts protein mapping methods. In the second stage, Fourier transform was applied to each mapped protein sequences. In the third stage, the distance matrix was generated by finding the distances between the proteins belonging to the same virus genome. In the fourth stage, Pearson correlation values between the distances were calculated and coevolution analysis was performed. In the last stage, the proposed mapping method was compared with state-of-arts protein mapping methods and MirrorTree approach. Coevolution analysis was performed on two different virus genomes; Ebola virus and Influenza A virus. With the proposed method, a high degree of correlation has been obtained between proteins of the Ebola virus. For Ebola virus, the lowest correlation result (0.75) was obtained between the NP-VP35 protein pair. The highest correlation (0.99) was observed between the NP-VP24 and NP-VP40 protein pairs. For Influenza A, the lowest correlation (0.09) was obtained between the M1-PA(X) protein pair with the proposed method. The highest correlation value (0.98) with the proposed method was calculated between the M1-M2 protein pair. The proposed method verified the interactions between protein pairs, which have been experimentally proven, with a high degree correlation value. These results indicated that the proposed method can be effective in predicting protein interactions.
机译:蛋白质 - 蛋白质相互作用在DNA转录,免疫系统和细胞之间的信号传递中具有重要作用。确定蛋白质之间的相互作用可以提供有关细胞功能结构的信息和靶生物的功能。通过实验方法确定蛋白质 - 蛋白质相互作用然而,在指定生物体中的所有可能的蛋白质相互作用时仍存在巨大差异。此外,由于这些方法使用克隆,标记和亲和力质谱,因此分析过程是耗时和昂贵的。然而,分析基于共区理论的计算方法的蛋白质相互作用消除了这些局限性,因为在共区面理论模型中,相互作用蛋白显示共脉冲突变并形成类似的系统发育树。当前的共区方法基于多序列对准过程;然而,这些方法出现了许多高误阳性相互作用。因此,重要的是要执行基于计算的共同区分分析。使用共谱分析的蛋白质 - 蛋白质相互作用与实验方法一起使用以探索新的蛋白质相互作用。然而,为了预测与基于计算的共同分析的蛋白质相互作用,需要映射蛋白质序列。有各种类型的蛋白质映射方法属于文献中某些类别。这些方法经常用于预测蛋白质相互作用的研究。在本研究中,作为这些方法的替代方案,我们提出了一种新的基于熵的蛋白质作图方法和通过参数分析通过参数分析预测病毒基因组的蛋白质 - 蛋白质相互作用。该研究由5个阶段组成。在第一阶段,使用所提出的数值映射方法和最先进的蛋白质映射方法映射病毒基因组的蛋白质序列。在第二阶段,将傅里叶变换应用于每个映射的蛋白质序列。在第三阶段,通过在属于同一病毒基因组之间的蛋白质之间的距离来产生距离矩阵。在第四阶段,计算距离之间的Pearson相关值,并进行共同分析。在最后阶段,将所提出的映射方法与最先进的蛋白质映射方法和MistorREE方法进行比较。在两种不同的病毒基因组上进行共区分分析;埃博拉病毒和流感病毒。利用所提出的方法,在埃博拉病毒的蛋白质之间获得了高度的相关性。对于埃博拉病毒,在NP-VP35蛋白对之间获得最低相关结果(0.75)。在NP-VP24和NP-VP40蛋白对之间观察到最高的相关性(0.99)。对于流感A,与所提出的方法在M1-PA(X)蛋白对之间获得最低的相关性(0.09)。在M1-M2蛋白对之间计算具有所提出的方法的最高相关值(0.98)。所提出的方法验证了蛋白质对之间的相互作用,该相对已经经过实验证明,具有高度的相关值。这些结果表明,所提出的方法可以有效地预测蛋白质相互作用。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号