首页> 外文学位 >Approach for mining multiple dependence structure with pattern recognition applications.
【24h】

Approach for mining multiple dependence structure with pattern recognition applications.

机译:利用模式识别应用程序挖掘多依赖结构的方法。

获取原文
获取原文并翻译 | 示例

摘要

Multiple dependence structure mining is a basic task for many fields, such as the data mining and pattern recognition. In this thesis we investigate, theoretically or empirically, several typical approaches for the data dependence structure mining, with the applications on both feature extraction and shape detection in image, two basic topics in pattern recognition.; We start by the local PCA model, which considers the dependence structure up to second order statistics. Typically, we apply the local PCA model on two interesting application cases. The first one is the astronomy object detection and classification, via the Bayesian Ying-Yang (BYY) normalization learning based local PCA algorithm, as both the feature extraction and clustering tools. The second application is the strip line detection and thinning in image by the rival penalized competitive learning (RPCL) based local PCA algorithm, via constraining each local subspace with a line-shape structure.; To describe the dependence structure concerning higher statistics and even with noise extension, we then proceed to discuss the ICA mixture and NFA mixture model for the feature extraction. The ICA mixture, which aims at the multiple independence structure mining, is superior to local PCA in the sense that the ICA can take advantage of the higher-order statistics of the samples, and meantime, the NFA mixture further improves the ICA mixture in that it relaxes the impractical noise-free assumption for the ICA. We not only applied the two models as the feature extraction tool in the star/galaxy classification system, but more importantly, (1) for the ICA mixture, under certain weak assumptions we proved a fundamentally important issue for the ICA model, i.e., the so-called one-bit-matching conjecture which states that “all the sources can be separated as long as there is a one-to-one same-sign-correspondence between the kurtosis signs of all source probability density functions (pdf's) and the kurtosis signs of all model pdf's”. (2) For the NFA mixture model, we focus on a key yet analytically intractable step—the factor estimating step for the algorithm.; Furthermore, for the non-liner shape dependence structure, we proceed to discuss the multisets mixture learning (MML) based shape detection in image. The MML provides a general method for shape detection in image by minimizing the mean square error (MSE) reconstruction error and the shapes to be detected for the MML can be roughly classified into two categories. The first category includes the shapes that can be mathematically formulized and the second category includes those represented by a pre-set template. For the first category which by nature needs specific algorithms for different shapes, we develop a MML based algorithm for detecting ellipse in image. For the second category whose template is represented by a set of contour points, we develop an efficient line segment approximation (LSA) algorithm to calculate the sample reconstruction error, which needs not enumerate the distances between the sample and all the contour points. (Abstract shortened by UMI.)
机译:多依赖结构挖掘是许多领域的基本任务,例如数据挖掘和模式识别。在本文中,我们从理论上或经验上研究了几种典型的数据依赖结构挖掘方法,并将其应用于图像特征提取和形状检测这两个模式识别的基本主题。我们从本地PCA模型开始,该模型考虑依赖结构直至二阶统计量。通常,我们将本地PCA模型应用于两个有趣的应用案例。第一个是天文学目标的检测和分类,它是通过基于贝叶斯英阳(BYY)归一化学习的局部PCA算法进行的,同时作为特征提取和聚类工具。第二个应用是通过用线形结构约束每个局部子空间,通过基于竞争性惩罚性竞争学习(RPCL)的局部PCA算法对带状线进行检测和细化。为了描述与更高统计量甚至噪声扩展相关的依存结构,我们接着讨论用于特征提取的ICA混合和NFA混合模型。旨在多重独立结构挖掘的ICA混合物优于ICA,因为ICA可以利用样本的高阶统计量,同时NFA混合物在以下方面进一步改善了ICA混合物:它放宽了ICA不切实际的无噪声假设。我们不仅将这两个模型用作星/银河分类系统中的特征提取工具,而且更重要的是,(1)对于ICA混合物,在某些弱假设下,我们证明了ICA模型的根本重要问题,即所谓的一位匹配猜想,即“只要所有源概率密度函数(pdf's)的峰度符号之间存在一对一的相同符号对应关系,就可以分离所有源。所有模型pdf的峰度标志”。 (2)对于NFA混合模型,我们专注于关键但分析上难以处理的步骤-该算法的因子估计步骤。此外,对于非线性形状依赖结构,我们继续讨论基于多集混合学习(MML)的图像形状检测。 MML通过最小化均方误差(MSE)重建误差,为图像中的形状检测提供了一种通用方法,针对MML的待检测形状可以大致分为两类。第一类别包括可以数学公式化的形状,第二类别包括由预设模板表示的形状。对于本质上需要针对不同形状的特定算法的第一类,我们开发了一种基于MML的算法来检测图像中的椭圆。对于第二类模板由一组轮廓点表示的模型,我们开发了一种有效的线段近似(LSA)算法来计算样本重构误差,该算法无需枚举样本与所有轮廓点之间的距离。 (摘要由UMI缩短。)

著录项

  • 作者

    Liu, Zhiyong.;

  • 作者单位

    Chinese University of Hong Kong (People's Republic of China).;

  • 授予单位 Chinese University of Hong Kong (People's Republic of China).;
  • 学科 Computer Science.; Engineering Electronics and Electrical.; Statistics.
  • 学位 Ph.D.
  • 年度 2003
  • 页码 136 p.
  • 总页数 136
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;无线电电子学、电信技术;统计学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号