首页> 外文期刊>Computational statistics & data analysis >Quantile map: Simultaneous visualization of patterns in many distributions with application to tandem mass spectrometry
【24h】

Quantile map: Simultaneous visualization of patterns in many distributions with application to tandem mass spectrometry

机译:分位数图:同时显示多种分布中的模式,并应用于串联质谱

获取原文
获取原文并翻译 | 示例
           

摘要

High-throughput experiments have become more and more prevalent in biomedical research. The resulting high-dimensional data have brought new challenges. Effective data reduction, summarization and visualization are important keys to initial exploration in data mining. In this paper, we introduce a visualization tool, namely a quantile map, to present information contained in a probabilistic distribution. We demonstrate its use as an effective visual analysis tool through the application of a tandem mass spectrometry data set. Information of quantiles of a distribution is presented in gradient colors by concentric doughnuts. The width of the doughnuts is proportional to the Fisher information of the distribution to present unbiased visualization effect. A parametric empirical Bayes (PEB) approach is shown to improve the simple maximum likelihood estimate (MLE) approach when estimating the Fisher information. In the motivating example from tandem mass spectrometry data, multiple probabilistic distributions are to be displayed in two-dimensional grids. A hierarchical clustering to reorder rows and columns and a gradient color selection from a Hue-Chroma-Luminance model, similar to that commonly applied in heatmaps of microarray analysis, are adopted to improve the visualization. Both simulations and the motivating example show superior performance of the quantile map in summarization and visualization of such high-throughput data sets.
机译:高通量实验在生物医学研究中越来越普遍。由此产生的高维数据带来了新的挑战。有效的数据缩减,摘要和可视化是数据挖掘中初步探索的重要关键。在本文中,我们介绍了一种可视化工具,即分位数图,以呈现概率分布中包含的信息。我们通过应用串联质谱数据集证明了其作为有效的视觉分析工具的用途。分布的分位数信息由同心圆环以渐变颜色显示。甜甜圈的宽度与分布的Fisher信息成比例,以呈现无偏的可视化效果。显示了参数化经验贝叶斯(PEB)方法,可以在估计Fisher信息时改进简单最大似然估计(MLE)方法。在来自串联质谱数据的激励示例中,多个概率分布将显示在二维网格中。与通常用于微阵列分析热图中的相似,采用了层次聚类来对行和列进行重新排序以及从Hue-Chroma-Luminance模型中选择渐变颜色,以改善可视化效果。仿真和激励示例均显示了分位数图在汇总和可视化此类高通量数据集方面的出色性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号