首页> 外文会议>IEEE International Conference on Computer Supported Cooperative Work in Design >Collaborative Data Analytics towards Prediction on Pathogen-Host Protein-Protein Interactions
【24h】

Collaborative Data Analytics towards Prediction on Pathogen-Host Protein-Protein Interactions

机译:对病原体宿主蛋白质 - 蛋白质相互作用预测的协同数据分析

获取原文

摘要

Nowadays more and more data are being sequenced and accumulated in system biology, which brings the data analytics researchers to a brand new era, namely "big data", to extract the inner relationship and knowledge from the huge amount of data. Bridging the gap between computational methodology and biology to accelerate the development of biology analytics has been a hot area. In this paper, we focus on these enormous amounts of data generated with the speedy development of high throughput technologies during the past decades, especially for protein-protein interactions, which are the critical molecular process in biology. Since pathogen-host protein-protein interactions are the major and basic problems for not only infectious diseases but also drug design, molecular level interactions between pathogen and host play very critical role for the study of infection mechanisms. In this paper, we built a basic framework for analyzing the specific problems about pathogen-host protein-protein interactions (PHPPI), meanwhile, we also presented the state-of-art deep learning method results on prediction of PHPPI comparing with other machine learning methods. Utilizing the evaluation methods, specifically by considering the high skewed imbalanced ratio and huge amount of data, we detailed the pipeline solution on both storing and learning for PHPPI. This work contributes as a basis for a further investigation of protein and protein-protein interactions, with the collaboration of data analytics results from the vast amount of data dispersedly available in biology literature.
机译:如今正在越来越多的数据在系统生物学中进行排序并累积,这将数据分析研究人员带到了一个全新的时代,即“大数据”,从大量数据中提取内部关系和知识。弥合计算方法和生物学之间的差距,以加速生物学分析的发展是一个热门区域。在本文中,我们专注于在过去几十年中快速发展的这些巨大的数据,特别是对于蛋白质 - 蛋白质相互作用,这是生物学中的临界分子过程。由于病原体 - 宿主蛋白质 - 蛋白质相互作用是不仅传染病而且药物设计,病原体与宿主之间的分子水平相互作用对感染机制的研究起到了非常关键的作用。在本文中,我们建立了一个基本的框架,用于分析有关病原体的宿主蛋白 - 蛋白相互作用的具体问题(PHPPI),同时,我们还提出了关于PHPPI预测的最先进技术的深学习方法的结果与其他机器学习比较方法。利用评估方法,具体地考虑到高偏斜的不平衡比和大量数据,我们详细介绍了对PHPPI的存储和学习的管道解决方案。这项工作有助于进一步调查蛋白质和蛋白质 - 蛋白质相互作用的基础,数据分析的协作是由在生物文献中分散的大量数据产生的数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号