首页> 外国专利> METHOD FOR IDENTIFYING OUTLIERS IN LARGE DATA SETS

METHOD FOR IDENTIFYING OUTLIERS IN LARGE DATA SETS

机译：大型数据集中的外围对象识别方法

页面导航

摘要
著录项
相似文献

摘要

A new method for identifying a predetermined number of data points of interest in a large data set. The data points of interest are ranked in relation to the distance to their neighboring points. The method employs partition-based detection algorithms to partition the data points and then compute upper and lower bounds for each partition. These bounds are then used to eliminate those partitions that do contain the predetermined number of data points of interest. The data points of interest are then computed from the remaining partitions that were not eliminated. The present method eliminates a significant number of data points from consideration as the points of interest, thereby resulting in substantial savings in computational expense compared to conventional methods employed to identify such points.

机译：一种用于识别大型数据集中预定数量的感兴趣数据点的新方法。感兴趣的数据点是根据到其相邻点的距离进行排序的。该方法采用基于分区的检测算法对数据点进行分区，然后为每个分区计算上限和下限。然后使用这些界限消除那些确实包含预定数量的感兴趣数据点的分区。然后从尚未消除的其余分区中计算出感兴趣的数据点。本方法从考虑中消除了大量数据点作为关注点，从而与用于识别此类点的常规方法相比，节省了计算费用。

著录项

公开/公告号US2003061249A1

专利类型
公开/公告日2003-03-27

原文格式PDF
申请/专利权人 RAMASWAMY SRIDHAR;RASTOGI RAJEEV;SHIM KYUSEOK;
展开▼

申请/专利号US19990442912
发明设计人 RAJEEV RASTOGI;KYUSEOK SHIM;SRIDHAR RAMASWAMY;
展开▼

申请日1999-11-18
分类号G06F3/00;
国家 US
入库时间 2022-08-22 00:09:43

相似文献

专利
外文文献
中文文献