首页> 外文期刊>Bioinformatics >Reconstructing the temporal ordering of biological samples using microarray data.
【24h】

Reconstructing the temporal ordering of biological samples using microarray data.

机译:使用微阵列数据重建生物样品的时间顺序。

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: Accurate time series for biological processes are difficult to estimate due to problems of synchronization, temporal sampling and rate heterogeneity. Methods are needed that can utilize multi-dimensional data, such as those resulting from DNA microarray experiments, in order to reconstruct time series from unordered or poorly ordered sets of observations. Results: We present a set of algorithms for estimating temporal orderings from unordered sets of sample elements. The techniques we describe are based on modifications of a minimum-spanning tree calculated from a weighted, undirected graph. We demonstrate the efficacy of our approach by applying these techniques to an artificial data set as well as several gene expression data sets derived from DNA microarray experiments. In addition to estimating orderings, the techniques we describe also provide useful heuristics for assessing relevant properties of sample datasets such as noise and sampling intensity, and we show how a data structure called a PQ-tree can be used to represent uncertainty in a reconstructed ordering. Availability: Academic implementations of the ordering algorithms are available as source code (in the programming language Python) on our web site, along with documentation on their use. The artificial 'jelly roll' data set upon which the algorithm was tested is also available from this web site. The publicly available gene expression data may be found at http://genome-www.stanford.edu/cellcycle/ and http://caulobacter.stanford.edu/CellCycle/ Contact: junhyong@sas.upenn.edu
机译:动机:由于同步,时间采样和速率异质性等问题,难以估计生物过程的准确时间序列。需要可以利用多维数据的方法,例如由DNA微阵列实验得到的数据,以便从无序或无序的观测集中重建时间序列。结果:我们提出了一组算法,用于从无序的样本元素集中估计时间顺序。我们描述的技术基于对从加权无向图计算出的最小生成树的修改。我们通过将这些技术应用于人工数据集以及源自DNA微阵列实验的几个基因表达数据集,证明了我们方法的有效性。除了估计顺序,我们描述的技术还提供了有用的启发式方法,用于评估样本数据集的相关属性,例如噪声和采样强度,并且我们展示了如何使用称为PQ树的数据结构来表示重构顺序中的不确定性。 。可用性:订购算法的学术实现可在我们的网站上以源代码(以Python编程语言)的形式获得,以及有关其使用的文档。也可以从该网站上获得用于测试算法的人工“果冻卷”数据集。公开的基因表达数据可在http://genome-www.stanford.edu/cellcycle/和http://caulobacter.stanford.edu/CellCycle/上找到:联系人:junhyong@sas.upenn.edu

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号