首页> 外文期刊>Journal of Integrative Bioinformatics >Integrated simultaneous analysis of different biomedical data types with exact weighted bi-cluster editing
【24h】

Integrated simultaneous analysis of different biomedical data types with exact weighted bi-cluster editing

机译:具有精确加权双群集编辑的不同生物医学数据类型的综合同时分析

获取原文
           

摘要

Summary The explosion of biological data has largely influenced the focus of today’s biology research. Integrating and analysing large quantity of data to provide meaningful insights has become the main challenge to biologists and bioinformaticians. One major problem is the combined data analysis of data from different types, such as phenotypes and genotypes. This data is modelled as bi-partite graphs where nodes correspond to the different data points, mutations and diseases for instance, and weighted edges relate to associations between them. Bi-clustering is a special case of clustering designed for partitioning two different types of data simultaneously. We present a bi-clustering approach that solves the NP-hard weighted bi-cluster editing problem by transforming a given bi-partite graph into a disjoint union of bi-cliques. Here we contribute with an exact algorithm that is based on fixed-parameter tractability. We evaluated its performance on artificial graphs first. Afterwards we exemplarily applied our Java implementation to data of genome-wide association studies (GWAS) data aiming for discovering new, previously unobserved geno-to-pheno associations. We believe that our results will serve as guidelines for further wet lab investigations. Generally our software can be applied to any kind of data that can be modelled as bi-partite graphs. To our knowledge it is the fastest exact method for weighted bi-cluster editing problem.
机译:发明内容生物数据的爆炸在很大程度上影响了当今生物学研究的重点。整合和分析大量数据以提供有意义的见解已成为生物学家和生物信息管理员的主要挑战。一个主要问题是来自不同类型的数据的组合数据分析,例如表型和基因型。该数据被建模为Bi-Partione图,其中节点对应于例如不同的数据点,突变和疾病,而加权边缘与它们之间的关联有关。双群集是群集的特殊情况,用于同时分隔两种不同类型的数据。我们介绍了一种双聚类方法,通过将给定的双脚部图转换为双群的不相交联盟来解决NP硬加权双簇编辑问题。在这里,我们用基于固定参数途径的精确算法贡献。我们首先评估其在人造图中的性能。之后,我们示例性地将我们的Java实施应用于基因组 - 范围协会研究(GWAS)数据的数据,旨在发现新的,以前未观察到的基因对苯对卵对关联。我们相信,我们的结果将作为进一步潮湿实验室调查的指导方针。一般来说,我们的软件可以应用于任何可以建模为Bi-Partite图形的任何类型的数据。据我们所知,它是加权双群集编辑问题的最快精确方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号