...
首页> 外文期刊>Information visualization >Visualization model validation via inline replication
【24h】

Visualization model validation via inline replication

机译:通过内联复制进行可视化模型验证

获取原文
获取原文并翻译 | 示例

摘要

Data visualizations typically show a representation of a data set with little to no focus on the repeatability or generalizability of the displayed trends and patterns. However, insights gleaned from these visualizations are often used as the basis for decisions about future events. Visualizations of retrospective data therefore often serve as "visual predictive models." However, this visual predictive model approach can lead to invalid inferences. In this article, we describe an approach to visual model validation called Inline Replication. Inline Replication is closely related to the statistical techniques of bootstrap sampling and cross-validation and, like those methods, provides a non-parametric and broadly applicable technique for assessing the variance of findings from visualizations. This article describes the overall Inline Replication process and outlines how it can be integrated into both traditional and emerging "big data" visualization pipelines. It also provides examples of how Inline Replication can be integrated into common visualization techniques such as bar charts and linear regression lines. Results from an empirical evaluation of the technique and two prototype Inline Replication-based visual analysis systems are also described. The empirical evaluation demonstrates the impact of Inline Replication under different conditions, showing that both (1) the level of partitioning and (2) the approach to aggregation have a major influence over its behavior. The results highlight the trade-offs in choosing Inline Replication parameters but suggest that using n=5 partitions is a reasonable default.
机译:数据可视化通常显示数据集的表示,很少或根本不关注所显示趋势和模式的可重复性或概括性。但是,从这些可视化中收集到的见解通常被用作有关未来事件的决策的基础。因此,回顾性数据的可视化通常用作“视觉预测模型”。但是,这种视觉预测模型方法可能导致无效的推断。在本文中,我们描述了一种称为内联复制的可视化模型验证方法。内联复制与引导程序抽样和交叉验证的统计技术密切相关,并且像那些方法一样,它提供了一种非参数且广泛适用的技术,用于评估可视化结果的差异。本文介绍了整个内联复制过程,并概述了如何将其集成到传统的和新兴的“大数据”可视化管道中。它还提供了示例如何将Inline Replication集成到常见的可视化技术中,例如条形图和线性回归线。还描述了该技术和两个基于Inline Replication的原型可视化分析系统的经验评估结果。经验评估证明了内联复制在不同条件下的影响,表明(1)分区级别和(2)聚合方法对其行为都有重大影响。结果突出显示了选择内联复制参数时的权衡,但建议使用n = 5分区是合理的默认设置。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号