...
【24h】

Detection and Deletion of Outliers from Large Datasets

机译:从大数据集中检测和删除异常值

获取原文

摘要

The paper proposes a method for detecting and deleting distance based outliers in very large data sets. This is based on the outlier detection solving set algorithm. This method introduces parallel computation so as to save more time and having excellent performance. First, weights are assigned to each of the data in the data sets. Based on the weights outliers from all the data sets are obtained by using the distance based method and finally they are all deleted. By deleting the outliers, it increases the space for storing more data.
机译:本文提出了一种在非常大的数据集中检测和删除距离异常值的方法。这基于离群值检测求解集算法。该方法引入了并行计算,以节省更多时间并具有出色的性能。首先,将权重分配给数据集中的每个数据。基于所有数据集的权重异常值,使用基于距离的方法获得,最后将它们全部删除。通过删除异常值,它增加了存储更多数据的空间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号