首页> 外文学位 >Human-machine collaboration for geographic knowledge discovery with high-dimensional clustering.
【24h】

Human-machine collaboration for geographic knowledge discovery with high-dimensional clustering.

机译:人机协作,用于通过高维聚类发现地理知识。

获取原文
获取原文并翻译 | 示例

摘要

Increasingly large (i.e., having a large number of observations) and high-dimensional (i.e., having many attributes) geographic data are being collected, but the spatial data analysis capabilities currently available have not kept up with the need for deriving meaningful information from these datasets. It is critical to develop new techniques to efficiently and effectively assist in analyzing current large and high-dimensional geographic datasets and addressing complex geographic problems, e.g., global change, socio-demographic factors for epidemiology, etc.; The goal of the reported research is to develop a geographic knowledge discovery environment and an integrated suite of efficient and effective data mining techniques for exploring novel, complex spatial patterns in large and high-dimensional geographic datasets. As the first step, the reported research focuses on interactive, hierarchical, multivariate spatial clustering.; The major contribution of the research is twofold. First, the research develops three novel approaches for spatial clustering, feature selection and multivariate clustering, and several visualization techniques to support visual exploration and human interactions. Second, it integrates both computational and visualization methods in a unified and flexible framework to create a human-led, computer-assisted, efficient and effective geographic knowledge discovery environment. Specifically, the developed knowledge discovery environment consists of four major groups of methods. (1) An efficient hierarchical spatial clustering method, which can identify arbitrary-shaped hierarchical 2D clusters at different scales, and generate a 1D ordering of the spatial points that preserves the entire hierarchical cluster structure; (2) An efficient and effective feature selection method, which can identify interesting subsets of attributes from the original data space; (3) An efficient hierarchical, multivariate clustering method, which can identify arbitrary-shaped hierarchical multivariate clusters given a set of attributes; (4) Various visualization techniques associated with each above method to support an interactive and iterative discovery process.; The developed methods are implemented within a component-oriented framework, which is: (1) flexible to customize and evolve over time, (2) collaborative in integrating various components to work together and address complex problems, and (3) robust to use and maintain. Three applications of the developed geographic knowledge discovery environment are presented to demonstrate how the developed methods and integrated discovery environment work and how well they work.
机译:正在收集越来越大的(即具有大量观测值)和高维(即具有许多属性)地理数据,但是当前可用的空间数据分析功能未能满足从这些数据中获取有意义的信息的需求。数据集。开发新技术以有效和有效地协助分析当前的大型和高维地理数据集并解决复杂的地理问题,例如全球变化,流行病学的社会人口学因素等,至关重要;所报告研究的目标是开发地理知识发现环境和一套高效有效的数据挖掘技术集成套件,以探索大型和高维地理数据集中的新颖,复杂的空间模式。作为第一步,已报道的研究重点是交互式的,分层的,多元的空间聚类。该研究的主要贡献是双重的。首先,这项研究开发了三种用于空间聚类,特征选择和多元聚类的新颖方法,以及支持可视化探索和人类交互的几种可视化技术。其次,它将计算和可视化方法集成在一个统一且灵活的框架中,以创建一个人为主导,计算机辅助,高效和有效的地理知识发现环境。具体而言,已开发的知识发现环境包括四大类方法。 (1)一种有效的分层空间聚类方法,可以识别不同比例的任意形状的分层2D聚类,并生成保留整个分层聚类结构的空间点的一维排序; (2)一种高效的特征选择方法,可以从原始数据空间中识别出有趣的属性子集; (3)一种有效的分层多元聚类方法,可以在给定一组属性的情况下识别任意形状的分层多元聚类; (4)与上述每种方法相关联的各种可视化技术,以支持交互和迭代的发现过程。所开发的方法是在面向组件的框架内实现的,该框架是:(1)灵活地随时间定制和发展;(2)协作集成各种组件以一起工作并解决复杂的问题;(3)健壮的使用和保持。介绍了已开发的地理知识发现环境的三个应用程序,以演示已开发的方法和集成的发现环境如何工作以及它们如何工作。

著录项

  • 作者

    Guo, Diansheng.;

  • 作者单位

    The Pennsylvania State University.;

  • 授予单位 The Pennsylvania State University.;
  • 学科 Geography.; Information Science.
  • 学位 Ph.D.
  • 年度 2003
  • 页码 228 p.
  • 总页数 228
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自然地理学;信息与知识传播;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号