首页> 美国卫生研究院文献>other >Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression
【2h】

Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression

机译:基于算术级数和的线性回归中的异常值检测方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We introduce a new nonparametric outlier detection method for linear series, which requires no missing or removed data imputation. For an arithmetic progression (a series without outliers) with n elements, the ratio (R) of the sum of the minimum and the maximum elements and the sum of all elements is always 2 : (0,1]. R ≠ 2 always implies the existence of outliers. Usually, R < 2 implies that the minimum is an outlier, and R > 2 implies that the maximum is an outlier. Based upon this, we derived a new method for identifying significant and nonsignificant outliers, separately. Two different techniques were used to manage missing data and removed outliers: (1) recalculate the terms after (or before) the removed or missing element while maintaining the initial angle in relation to a certain point or (2) transform data into a constant value, which is not affected by missing or removed elements. With a reference element, which was not an outlier, the method detected all outliers from data sets with 6 to 1000 elements containing 50% outliers which deviated by a factor of ±1.0e − 2 to ±1.0e + 2 from the correct value.
机译:我们为线性序列引入了一种新的非参数离群值检测方法,该方法不需要丢失或删除数据插补。对于具有n个元素的算术级数(无离群值的序列),最小和最大元素之和与所有元素之和的比率(R)始终为2 / n:(0,1]。R≠2 / n总是表示存在离群值,通常,R <2 / n表示最小值是离群值,R> 2 / n表示最大值是离群值,在此基础上,我们得出了一种识别重要值的新方法。分别使用两种不同的技术来管理丢失的数据和删除的异常值:(1)重新计算删除或丢失的元素之后(或之前)的项,同时保持相对于某个点的初始角度;或(2)将数据转换为恒定值,不受丢失或删除的元素的影响;对于不是异常值的参考元素,该方法从包含6到1000个元素的数据集中检测到所有异常值,其中包含50%的异常值正确v的±1.0e − 2至±1.0e + 2好的

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号