首页> 美国卫生研究院文献>other >Universal Linear Fit Identification: A Method Independent of Data Outliers and Noise Distribution Model and Free of Missing or Removed Data Imputation

【2h】

Universal Linear Fit Identification: A Method Independent of Data Outliers and Noise Distribution Model and Free of Missing or Removed Data Imputation

机译：通用线性拟合识别：一种独立于数据离群值和噪声分布模型且无缺失或缺失数据插补的方法

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data processing requires a robust linear fit identification method. In this paper, we introduce a non-parametric robust linear fit identification method for time series. The method uses an indicator 2 to identify linear fit, where n is number of terms in a series. The ratio R max of a max − a min and S n − a min *n and that of R min of a max − a min and a max *n − S n are always equal to 2, where a max is the maximum element, amin is the minimum element and Sn is the sum of all elements. If any series expected to follow y = c consists of data that do not agree with y = c form, Rmax> 2 and Rmin> 2 imply that the maximum and minimum elements, respectively, do not agree with linear fit. We define threshold values for outliers and noise detection as 2 * (1 + k1) and 2 * (1 + k₂), respectively, where k₁ > k₂ and 0 ≤ k₁≤ n/2 − 1. Given this relation and transformation technique, which transforms data into the form y = c, we show that removing all data that do not agree with linear fit is possible. Furthermore, the method is independent of the number of data points, missing data, removed data points and nature of distribution (Gaussian or non-Gaussian) of outliers, noise and clean data. These are major advantages over the existing linear fit methods. Since having a perfect linear relation between two variables in the real world is impossible, we used artificial data sets with extreme conditions to verify the method. The method detects the correct linear fit when the percentage of data agreeing with linear fit is less than 50%, and the deviation of data that do not agree with linear fit is very small, of the order of ±10⁻⁴%. The method results in incorrect detections only when numerical accuracy is insufficient in the calculation process.

机译：数据处理需要可靠的线性拟合识别方法。在本文中，我们介绍了一种用于时间序列的非参数鲁棒线性拟合识别方法。该方法使用指标2 / n标识线性拟合，其中n是序列中的项数。 max-a min和S n-a min * n的比率R max与 max -a min 的R min比率和 a max * n-S n 始终等于 2 / n ，其中 a max 是最大元素 a min 是最小元素， S n 是所有元素的总和。如果期望遵循 y = c 的任何序列包含的数据与 y = c 形式不一致，则 R max 和 R min 分别表示最大和最小元素与线性拟合不一致。我们将异常值和噪声检测的阈值定义为 2 / n *（1 + k 1 ）和 2 / n *（1 + k _{2 ），其中 k _{1 k _{2 和 0≤k _{1 ≤n / 2-1 。使用这种关系和转换技术，可以将数据转换为 y = c 的形式，我们表明可以删除所有与线性拟合不一致的数据。此外，该方法与数据点的数量，丢失的数据，已删除的数据点以及离群点，噪声和纯净数据的分布性质（高斯或非高斯）无关。与现有的线性拟合方法相比，这些是主要优点。由于不可能在现实世界中的两个变量之间具有完美的线性关系，因此我们使用具有极端条件的人工数据集来验证该方法。当与线性拟合一致的数据的百分比小于50％，并且与线性拟合不一致的数据的偏差非常小时，该方法将检测到正确的线性拟合，约为±10 ^{-4 < / sup>％。该方法仅在计算过程中数值精度不足时才导致错误的检测。}}}}} 展开▼

著录项

期刊名称 other

作者
K. K. L. B. Adikaram; M. A. Hussein; M. Effenberger; T. Becker;
展开▼

作者单位

展开▼

年(卷),期 -1(10),11

年度 -1

页码 e0141486

总页数 18

原文格式 PDF

正文语种

中图分类

关键词

入库时间 2022-08-21 11:14:02

相似文献

外文文献

中文文献

专利

1. Maximum Likelihood Methods in Treating Outliers and Symmetrically Heavy-Tailed Distributions for Nonlinear Structural Equation Models with Missing Data [J] . Sik-Yum Lee, Ye-Mao Xia Psychometrika . 2006,第3期

机译：缺失数据的非线性结构方程模型的异常值和对称重尾分布的最大似然法

2. Comparison of imputation methods for handling missing covariate data when fitting a Cox proportional hazards model: a resampling study [J] . Andrea Marshall, Douglas G Altman, Roger L Holder BMC Medical Research Methodology . 2010,第1期

机译：拟合Cox比例风险模型时用于处理缺失协变量数据的插补方法的比较：一项重采样研究

3. Imputation of missing variance data using non-linear mixed effects modelling to enable an inverse variance weighted meta-analysis of summary-level longitudinal data: A case study [J] . BoucherM. Pharmaceutical statistics. . 2012,第4期

机译：使用非线性混合效应模型对缺失方差数据进行插补，以对摘要级纵向数据进行逆方差加权元分析：一个案例研究

4. A Fitted Fuzzy-rough Method for Missing Data Imputation [C] . Daiwei Li, Tianrui Li, Haiqing Zhang, International Conference on Intelligent Systems and Knowledge Engineering . 2019

机译：缺失数据插补的拟合模糊粗糙方法

5. Evaluating Multiple Imputation Methods for Longitudinal Healthy Aging Index—A Score Variable with Data Missing Due to Death, Dropout and Several Missing Data Mechanisms [D] . Kane, Elizabeth L. 2017

机译：纵向健康老龄化指数的多种估算方法的评估-一个因死亡，辍学和几种缺失数据机制导致数据缺失的得分变量

6. Comparison of imputation methods for handling missing covariate data when fitting a Cox proportional hazards model: a resampling study [O] . Andrea Marshall, Douglas G Altman, Roger L Holder 2010

机译：拟合Cox比例风险模型时用于处理缺失协变量数据的插补方法的比较：一项重采样研究

7. Universal Linear Fit Identification: A Method Independent of Data, Outliers and Noise Distribution Model and Free of Missing or Removed Data Imputation. [O] . K K L B Adikaram, M A Hussein, M Effenberger, 2015

机译：通用线性拟合识别：一种独立于数据，异常值和噪声分布模型的方法，没有丢失或删除的数据插补。

1. 一种选择性随机缺失数据的检验和插补方法 [J] . 余竞 . 统计与决策 . 2005,第09S期

2. 缺失数据插补方法探讨--基于最近邻插补法和关联规则法 [J] . 于力超 ,金勇进 ,王俊 . 统计与信息论坛 . 2015,第001期

3. 森林生态系统涡度相关法碳通量长时间连续性缺失数据插补方法的比较 [J] . 周宇 ,黄辉 ,张劲松 . 中国农业气象 . 2021,第004期

4. 缺失数据插补方法性能比较分析 [J] . 徐鸿艳 ,孙云山 ,秦琦琳 . 软件工程 . 2021,第011期

5. 基于小波网络的外测缺失数据插补方法 [J] . 李振兴 ,张必彦 ,黄晓冬 . 舰船电子工程 . 2020,第005期

6. 一种处理缺失数据的新方法 [C] . 管河山 ,姜青山 ,Steven X.Wei . 第二十一届中国数据库学术会议 . 2004

7. 缺失数据插补方法及其在医学领域的应用研究 [A] . 陈婉娇 . 2019

1. 池塘养殖pH值缺失数据插补方法 [P] . 中国专利： CN114037551A . 2022-02-11

2. 一种臭氧缺失数据的插补方法、装置及插补设备 [P] . 中国专利： CN111443163A . 2020-07-24

3. Method and system for anomaly detection, missing data imputation and consumption prediction in energy data [P] . 外国专利： US11089108B2 . 2021-08-10

机译：用于异常检测，缺少数据避难和消费预测的方法和系统

4. METHOD AND SYSTEM FOR ANOMALY DETECTION, MISSING DATA IMPUTATION AND CONSUMPTION PREDICTION IN ENERGY DATA [P] . 外国专利： EP3376446A1 . 2018-09-19

机译：能量数据异常检测，缺失数据归因和消耗预测的方法和系统

5. METHOD AND SYSTEM FOR ANOMALY DETECTTION, MISSING DATA IMPUTATION AND CONSUMPTION PREDICTION IN ENERGY DATA [P] . 外国专利： US2018270312A1 . 2018-09-20

机译：能量数据异常检测，缺失数据归因和消耗预测的方法和系统

相关主题

Universal Linear Fit Identification: A Method Independent of Data Outliers and Noise Distribution Model and Free of Missing or Removed Data Imputation

摘要

著录项

相似文献

相关主题

期刊订阅