首页> 外国专利> Visualizing large data volumes utilizing initial sampling and multi-stage calculations

Visualizing large data volumes utilizing initial sampling and multi-stage calculations

机译:利用初始采样和多阶段计算可视化大数据量

摘要

Embodiments visualize large data volumes utilizing initial sampling to reduce size of a dataset. This sampling may be random in nature. The sampled dataset may be refined (wrangled) by binning, grouping, cleansing, and/or other techniques to produce a wrangled sample dataset. A user defines useful end visualization(s) by inputting expected dimension/measures. From these visualizations of sampled data, minimal grouping sets are deduced for application to the full dataset. The user publishes/schedules the wrangled operation and grouping sets definition. Based on this, a wrangled dataset and grouping sets are produced in the big data layer. When the user accesses the visualization(s), minimal grouping sets are retrieved in the in-memory engine of the client and processed by an in-memory database engine according to the common processing plan. This produces result sets and a final set of visualizations of the full dataset, in which the user can recognize valuable data trends and/or relationships.
机译:实施例利用初始采样来可视化大数据量以减小数据集的大小。该采样本质上可以是随机的。可以通过合并,分组,清理和/或其他技术来精炼(纠缠)采样的数据集,以产生混乱的采样数据集。用户通过输入预期的尺寸/度量来定义有用的最终可视化。从这些可视化的采样数据中,可以推导出最小的分组集,以应用于整个数据集。用户发布/安排混乱的操作和分组集定义。基于此,在大数据层中产生了混乱的数据集和分组集。当用户访问可视化时,将在客户端的内存引擎中检索最少的分组集,并由内存数据库引擎根据通用处理计划进行处理。这将产生结果集和完整数据集的最终可视化集,用户可以在其中识别出有价值的数据趋势和/或关系。

著录项

  • 公开/公告号US10459932B2

    专利类型

  • 公开/公告日2019-10-29

    原文格式PDF

  • 申请/专利权人 ALEXIS NAIBO;XIAOHUI XU;YANN LE BIANNIC;

    申请/专利号US201414575633

  • 发明设计人 ALEXIS NAIBO;XIAOHUI XU;YANN LE BIANNIC;

    申请日2014-12-18

  • 分类号G06F16;G06F16/2458;G06F16/26;G06F16/9038;G06F16/21;G06F16/23;G06F16/338;G06F16/34;

  • 国家 US

  • 入库时间 2022-08-21 12:15:07

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号