【24h】

Mining ordinal patterns for data cleaning

机译:挖掘顺序模式进行数据清理

获取原文

摘要

It is well recognized that sequential pattern mining plays an essential role in many scientific and business domains. In this paper, a new extension of sequential pattern, ordinal pattern, is proposed. An ordinal pattern is an ordinal sequence of attributes, whose values commonly occur in ascending order over data set. Ordinal pattern mining requests that values of different attributes must be comparable and ordinal. After each record in data set is transformed into an ordinal sequence of attributes according to their ordinal values, ordinal patterns can be mined by means of mining sequential patterns. But our work is different from sequential pattern mining. One use of ordinal patterns is to identify possible error records in data cleaning, in which the values of attributes break the ordinal patterns which most of the data conform to. Experiments verify the high efficiency of the method presented.
机译:众所周知,顺序模式挖掘在许多科学和商业领域中都扮演着至关重要的角色。在本文中,提出了顺序模式的新扩展,即顺序模式。顺序模式是属性的顺序序列,其值通常在数据集上以升序出现。顺序模式挖掘要求不同属性的值必须具有可比性和顺序性。将数据集中的每个记录根据其序号值转换为属性的序数序列后,可以通过挖掘顺序模式来挖掘序数模式。但是我们的工作不同于顺序模式挖掘。序数模式的一种用途是在数据清理中识别可能的错误记录,其中属性的值破坏了大多数数据所遵循的序数模式。实验证明了所提出方法的高效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号