首页> 外文期刊>BMC Bioinformatics >Stability of building gene regulatory networks with sparse autoregressive models
【24h】

Stability of building gene regulatory networks with sparse autoregressive models

机译:用稀疏自回归模型构建基因调控网络的稳定性

获取原文
           

摘要

BackgroundBiological networks are constantly subjected to random perturbations, and efficient feedback and compensatory mechanisms exist to maintain their stability. There is an increased interest in building gene regulatory networks (GRNs) from temporal gene expression data because of their numerous applications in life sciences. However, because of the limited number of time points at which gene expressions can be gathered in practice, computational techniques of building GRN often lead to inaccuracies and instabilities. This paper investigates the stability of sparse auto-regressive models of building GRN from gene expression data.ResultsCriteria for evaluating the stability of estimating GRN structure are proposed. Thereby, stability of multivariate vector autoregressive (MVAR) methods - ridge, lasso, and elastic-net - of building GRN were studied by simulating temporal gene expression datasets on scale-free topologies as well as on real data gathered over Hela cell-cycle. Effects of the number of time points on the stability of constructing GRN are investigated. When the number of time points are relatively low compared to the size of network, both accuracy and stability are adversely affected. At least, the number of time points equal to the number of genes in the network are needed to achieve decent accuracy and stability of the networks. Our results on synthetic data indicate that the stability of lasso and elastic-net MVAR methods are comparable, and their accuracies are much higher than the ridge MVAR. As the size of the network grows, the number of time points required to achieve acceptable accuracy and stability are much less relative to the number of genes in the network. The effects of false negatives are easier to improve by increasing the number time points than those due to false positives. Application to HeLa cell-cycle gene expression dataset shows that biologically stable GRN can be obtained by introducing perturbations to the data.ConclusionsAccuracy and stability of building GRN are crucial for investigation of gene regulations. Sparse MVAR techniques such as lasso and elastic-net provide accurate and stable methods for building even GRN of small size. The effect of false negatives is corrected much easier with the increased number of time points than those due to false positives. With real data, we demonstrate how stable networks can be derived by introducing random perturbation to data.
机译:背景技术生物网络经常受到随机扰动,并且存在有效的反馈和补偿机制来维持其稳定性。由于它们在生命科学中的众多应用,因此越来越需要根据时态基因表达数据来构建基因调控网络(GRN)。但是,由于在实践中可以收集基因表达的时间点数量有限,因此构建GRN的计算技术通常会导致不准确和不稳定。本文利用基因表达数据研究了构建GRN的稀疏自回归模型的稳定性。结果提出了评价GRN结构稳定性的评价标准。因此,通过模拟无标度拓扑上的时间基因表达数据集以及在Hela细胞周期上收集的真实数据,研究了构建GRN的多元矢量自回归(MVAR)方法的稳定性(岭,套索和弹性网)。研究了时间点数对构建GRN稳定性的影响。当时间点的数量与网络的大小相比相对较低时,准确性和稳定性都会受到不利影响。至少需要时间点的数量等于网络中基因的数量,才能获得良好的网络准确性和稳定性。我们在综合数据上的结果表明,套索和弹性网MVAR方法的稳定性是可比的,它们的准确性比脊线MVAR高得多。随着网络规模的增长,达到可接受的准确性和稳定性所需的时间点数量相对于网络中的基因数量要少得多。通过增加时间点数,比由误报引起的负面影响更容易改善。在HeLa细胞周期基因表达数据集上的应用表明,通过对数据进行扰动可以获得生物学上稳定的GRN。结论构建GRN的准确性和稳定性对于研究基因调控至关重要。诸如套索和弹性网之类的稀疏MVAR技术为构建甚至小尺寸的GRN提供了准确而稳定的方法。与由于误报而导致的时间点数量增加相比,使用增加的时间点可以更容易地纠正误报的影响。利用真实数据,我们演示了如何通过对数据引入随机扰动来导出稳定的网络。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号