Finding outliers in models of spatial data

机译：在空间数据模型中发现异常值

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Statistical models fit to data often require extensive and challenging re-estimation before achieving final form. For example, outliers can adversely affect fits. In other cases involving spatial data, a cluster may exist for which the model is incorrect, also adversely affecting the fit to the "good" data. In both cases, estimate residuals must be checked and rechecked until the data are cleaned and the appropriate model found. In this article, we demonstrate an algorithm that fits models to the largest subset of the data that is appropriate. Specifically, if a hypothesized linear regression model fits ninety percent of the data, our algorithm can not only find an excellent fit as if only that "good" data were presented, but will also highlight the ten percent of the "bad" data that is not fit. Our work in digital government has focused on mapping data. Thus we illustrate how models fit to census track data work, and how the data in the "bad" set can be viewed spatially through ArcView or other tools. This approach greatly simplifies the task of modeling spatial data, and makes us of advanced map visualization tools to understand the nature of subsets of the data for which the model is not appropriate.

机译：适应数据的统计模型通常需要进行广泛且具有挑战性的重新估算，才能获得最终形式。例如，异常值可能会对拟合产生不利影响。在涉及空间数据的其他情况下，可能存在模型不正确的聚类，这也不利地影响了对“良好”数据的拟合。在这两种情况下，必须检查并重新检查估计的残差，直到清除数据并找到合适的模型为止。在本文中，我们演示了一种适合模型的算法，适用于最大数据子集。具体来说，如果假设的线性回归模型拟合了90％的数据，我们的算法不仅可以找到一个很好的拟合，就好像只显示了“好”数据一样，还可以突出显示10％的“坏”数据。不合适。我们在数字政府中的工作重点是映射数据。因此，我们说明了模型如何适合人口普查跟踪数据的工作，以及如何通过ArcView或其他工具在空间上查看“不良”集中的数据。这种方法极大地简化了空间数据建模的任务，并使我们能够使用高级地图可视化工具来了解不适用于该模型的数据子集的性质。

著录项

来源
《Proceedings of the 2003 annual national conference on Digital government research》|2003年|P.1-6|共6页
会议地点 Boston MA(US);Boston MA(US)
作者
David W. Scott; J. Blair Christian;
展开▼
作者单位

Rice University, Houston, TX;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Towards an integrated European Research Area? Findings from Eigenvector spatially filtered spatial interaction models using European Framework Programme data [J] . Thomas Scherngell, Rafael Lata Papers in regional science . 2013,第3期

机译：走向一体化的欧洲研究区？使用欧洲框架计划数据的特征向量空间滤波空间相互作用模型的发现
2. Accounting for regional background and population size in the detection of spatial clusters and outliers using geostatistical filtering and spatial neutral models: the case of lung cancer in Long Island, New York [J] . Pierre Goovaerts, Geoffrey M Jacquez International Journal of Health Geographics . 2004,第1期

机译：使用地统计过滤和空间中性模型在检测空间群和异常值时考虑区域背景和人口规模：纽约长岛的肺癌病例
3. Efficient Voronoi K-Means Algorithm for Mining Local Crime Spatial Outliers in Spatial Crime Data [J] . Dr. M. VijayaKumar, Dr. P. Balamurugan, Dr. Basim Alhadidi, Australian Journal of Basic and Applied Sciences . 2013,第2013期

机译：在空间犯罪数据中挖掘局部犯罪空间离群值的高效Voronoi K-Means算法
4. Finding outliers in models of spatial data [C] . David W. Scott, J. Blair Christian Annual national conference on Digital government research . 2003

机译：在空间数据模型中查找异常值
5. Outliers and data mining: Finding exceptions in data. [D] . Knorr, Edwin Max. 2002

机译：离群值和数据挖掘：在数据中查找异常。
6. Accounting for regional background and population size in the detection of spatial clusters and outliers using geostatistical filtering and spatial neutral models: the case of lung cancer in Long Island New York [O] . Pierre Goovaerts, Geoffrey M Jacquez 2004

机译：使用地统计过滤和空间中性模型在检测空间群和异常值时考虑区域背景和人口规模：纽约长岛的肺癌病例
7. Evaluating Spatial Outliers And Integrating Temporal Data In Air Pollution Models For The Detroit-Windsor Airshed [O] . Ou27leary Brendan Francis 2014

机译：评估空间异常值并将时间数据整合到底特律 - 温莎空气污染模型中
8. Linear Models, Time Series and Outliers. 2. Outliers in Linear Models. [R] . Abraham, B., Box, G. E. P. 1975

机译：线性模型，时间序列和异常值。 2.线性模型中的异常值。

Finding outliers in models of spatial data

摘要

著录项

相似文献

相关主题

期刊订阅