首页> 外文会议>The Fourth International Conference on Developments in eSystems Engineering >An Enhanced Technique to Clean Data in the Data Warehouse
【24h】

An Enhanced Technique to Clean Data in the Data Warehouse

机译:增强的数据仓库中数据清理技术

获取原文
获取原文并翻译 | 示例

摘要

Data quality is a critical factor for the success of data warehousing projects. Improving the quality of data is important in data warehouse, because it is used in the process of decision support, which requires accurate data. There are many errors and inconsistencies that occur in the data sets when brought in from several sources. Data cleaning is the process of identifying and removing or correcting errors in the data. There are some methods to deal with data cleaning, but they are generally inefficient in cleaning the data because they suffer from variety of errors. In this paper we present an enhanced technique to clean data in the data warehouse by using a new algorithm that detects and corrects most of the error types and expected problems, such as lexical errors, domain format errors, irregularities, integrity constraint violation, and duplicates.
机译:数据质量是数据仓库项目成功的关键因素。在数据仓库中,提高数据质量非常重要,因为在决策支持过程中需要使用数据,因此需要准确的数据。从多个来源引入数据集时,会出现许多错误和不一致之处。数据清理是识别和删除或纠正数据错误的过程。有一些方法可以处理数据清除,但由于存在多种错误,因此清除数据的效率通常较低。在本文中,我们提出了一种增强的技术,该技术使用一种新算法来清理数据仓库中的数据,该算法可检测并纠正大多数错误类型和预期问题,例如词法错误,域格式错误,不规则性,完整性约束违规和重复项。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号