首页>
外国专利>
METHOD FOR IDENTIFYING OUTLIERS IN LARGE DATA SETS
METHOD FOR IDENTIFYING OUTLIERS IN LARGE DATA SETS
展开▼
机译:大型数据集中的外围对象识别方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
A new method for identifying a predetermined number of data points of interest in a large data set. The data points of interest are ranked in relation to the distance to their neighboring points. The method employs partition-based detection algorithms to partition the data points and then compute upper and lower bounds for each partition. These bounds are then used to eliminate those partitions that do contain the predetermined number of data points of interest. The data points of interest are then computed from the remaining partitions that were not eliminated. The present method eliminates a significant number of data points from consideration as the points of interest, thereby resulting in substantial savings in computational expense compared to conventional methods employed to identify such points.
展开▼