首页> 外文会议>Industrial conference on data mining >Data Quality Visualization for Preprocessing
【24h】

Data Quality Visualization for Preprocessing

机译:预处理的数据质量可视化

获取原文

摘要

Preprocessing is often the most time-consuming phase in data analysis and interdependent data quality issues a cause of suboptimal modelling results. The design problem addressed in this paper is: what kind of framework can support visualization of data quality issue interdependencies for faster and more effective preprocessing? An object framework was designed that uses constructed features as a basis of visualizations. Six real datasets from business performance measurement system domain were acquired to demonstrate the implementation. The framework was found to be a viable preprocessing analysis supplement to both industry practice of exploratory data analysis and research benchmark of preprocessing combinations.
机译:预处理通常是数据分析中最耗时的阶段,相互依赖的数据质量会导致建模结果欠佳。本文解决的设计问题是:哪种框架可以支持可视化数据质量问题的相互依赖关系,以实现更快,更有效的预处理?设计了一个对象框架,该对象框架使用构造的功能作为可视化的基础。从业务绩效评估系统域中获取了六个真实的数据集,以演示该实现。发现该框架是对探索性数据分析的行业实践和预处理组合的研究基准的可行的预处理分析补充。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号