首页> 外文会议>IEEE International Symposium on Signal Processing and Information Technology >A Semi-Automatic System for Data Management and Cleaning
【24h】

A Semi-Automatic System for Data Management and Cleaning

机译:用于数据管理和清洁的半自动系统

获取原文

摘要

We present a data management system intended for semi-automatic database pre-processing. This system allows the user to obtain useable data from a large amount of records, simplifying the cleaning process with a set of built-in tools which allows the user to identify and exploit patterns in the data. This paper describes a system with set of tools including categorical definition of variables, multi-variable column split using stop characters, data cleansing based on regular expression and inclusion of tables and standards from datasets; is presented. The framework allows for operation traceability, making it feasible to create a repeatable and fast data pre-processing and cleansing procedure.
机译:我们提出了一种用于半自动数据库预处理的数据管理系统。该系统允许用户从大量记录中获得可用数据,简化了一组内置工具的清洁过程,允许用户识别和利用数据中的模式。本文介绍了一组工具组的系统,包括变量的分类定义,使用停止字符的多变量列分割,基于正则表达和包含来自数据集的表和标准的数据清理;被表达。该框架允许操作可追溯性,从而可以创建可重复和快速数据预处理和清洁程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号