首页> 外国专利> METHOD FOR IDENTIFYING OUTLIERS IN LARGE DATA SETS

METHOD FOR IDENTIFYING OUTLIERS IN LARGE DATA SETS

机译:大型数据集中的外围对象识别方法

摘要

A new method for identifying a predetermined number of data points of interest in a large data set. The data points of interest are ranked in relation to the distance to their neighboring points. The method employs partition-based detection algorithms to partition the data points and then compute upper and lower bounds for each partition. These bounds are then used to eliminate those partitions that do contain the predetermined number of data points of interest. The data points of interest are then computed from the remaining partitions that were not eliminated. The present method eliminates a significant number of data points from consideration as the points of interest, thereby resulting in substantial savings in computational expense compared to conventional methods employed to identify such points.
机译:一种用于识别大型数据集中预定数量的感兴趣数据点的新方法。感兴趣的数据点是根据到其相邻点的距离进行排序的。该方法采用基于分区的检测算法对数据点进行分区,然后为每个分区计算上限和下限。然后使用这些界限消除那些确实包含预定数量的感兴趣数据点的分区。然后从尚未消除的其余分区中计算出感兴趣的数据点。本方法从考虑中消除了大量数据点作为关注点,从而与用于识别此类点的常规方法相比,节省了计算费用。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号