【24h】

A Time-Delayed Information-Theoretic Approach to the Reverse Engineering of Gene Regulatory Networks Using Apache Spark

机译:使用Apache Spark进行基因调控网络逆向工程的时延信息理论方法

获取原文
获取原文并翻译 | 示例

摘要

Elucidating gene regulatory networks (GRNs) is crucial to understand the inner workings of the cell and the complexity of gene interactions. To date, numerous algorithms have been developed to infer or reconstruct gene regulatory networks from expression data. However, as the number of identified genes increases and the complexity of their interactions is uncovered, networks and their regulatory mechanisms become cumbersome to test. Furthermore, prodding through experimental results requires an enormous amount of computation, resulting in slow data processing. Therefore, new approaches are needed to expeditiously analyze copious amounts of experimental data resulting from cellular GRNs. To meet this need, cloud computing is promising as reported in the literature. Here we present a new algorithm for reverse engineering (inferring) gene regulatory networks on a computer cluster in a cloud environment. The algorithm, implemented in Apache Spark, employs an information-theoretic approach to infer GRNs from time-series gene expression data. Experimental results show that our Spark program is much faster than an existing tool while achieving the same prediction accuracy.
机译:阐明基因调控网络(GRN)对于了解细胞的内部运作以及基因相互作用的复杂性至关重要。迄今为止,已经开发了许多算法来从表达数据推断或重建基因调控网络。但是,随着鉴定出的基因数量的增加和相互作用的复杂性被发现,网络及其调节机制变得难以测试。此外,通过实验结果进行探测需要大量的计算,从而导致数据处理缓慢。因此,需要新的方法来迅速分析由细胞GRN产生的大量实验数据。为了满足这种需求,如文献所报道的,云计算是有前途的。在这里,我们提出了一种用于在云环境中的计算机集群上逆向工程(推断)基因调控网络的新算法。该算法在Apache Spark中实现,它采用信息论方法从时序基因表达数据中推断出GRN。实验结果表明,我们的Spark程序在实现相同预测精度的同时,比现有工具快得多。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号