首页> 外文学位 >Image annotation and feature engineering via structural sparsity and low-rank approximation.
【24h】

Image annotation and feature engineering via structural sparsity and low-rank approximation.

机译:通过结构稀疏性和低秩逼近进行图像注释和特征工程。

获取原文
获取原文并翻译 | 示例

摘要

Nowadays, in order to sense environment and understand human behaviors, data analysis plays a more and more important role to handle heterogeneous data ranging from different domains, e.g., image categorization/annotation, customer segmentation, traffic prediction, ad optimization, recommendation systems, privacy analysis, etc. The large amount of multivariate data raises the fundamental problem of data mining: how to discover meaningful compact patterns hidden in the high-dimensional noisy observations? One approach is to do dimension reduction, which finds the low-dimensional subspace and thus encodes data in a low-dimensional structure. The other approach is to do feature selection or feature engineering, which manipulates the features to capture the most discriminant patterns for classification/clustering tasks.;In this thesis, to further improve the low-dimensional embedding results, an iteratively locally linear embedding algorithm is proposed, which captures the global structure of non-linear manifold through iteratively updating the embedding. To handle noisy data (e.g., data with missing values, corrupted values) classification problem, a robust data recovery model via Schatten-p norm is proposed to preprocessing the noisy data, where the rank of the data is implicitly decreased. To utilize the feature structure with constraints, an efficient feature learning algorithm via group lasso is proposed to handle features on arbitrary structure, whose convergence can be rigorously proved. To handle the problem of limited labeled data in image categorization/annotation tasks, efficient maximum consistency label propagation methods are proposed to improve the performance of graph-based semi-supervised learning methods, which utilizes both the labeled data information and graph manifold information. Extensive experiments indicate the good performance of proposed algorithms.
机译:如今,为了感知环境并理解人类行为,数据分析在处理来自不同领域的异构数据方面扮演着越来越重要的角色,例如图像分类/注释,客户细分,流量预测,广告优化,推荐系统,隐私分析等。大量的多元数据提出了数据挖掘的根本问题:如何发现隐藏在高维噪声观测中的有意义的紧凑模式?一种方法是进行降维,找到低维子空间,从而以低维结构编码数据。另一种方法是进行特征选择或特征工程,该特征选择或特征工程可操纵特征以捕获最有区别的模式以进行分类/聚类任务。为进一步改善低维嵌入效果,本文提出了一种迭代局部线性嵌入算法提出的方法,通过迭代更新嵌入来捕获非线性流形的整体结构。为了处理嘈杂的数据(例如缺少值的数据,损坏的值的数据)分类问题,提出了一种通过Schatten-p范数的鲁棒数据恢复模型来预处理嘈杂的数据,其中数据的等级会隐式降低。为了利用具有约束的特征结构,提出了一种基于组套索的有效特征学习算法来处理任意结构上的特征,可以证明其收敛性。为了解决图像分类/注释任务中标记数据有限的问题,提出了一种有效的最大一致性标签传播方法,以提高基于图的半监督学习方法的性能,该方法同时利用了标记数据信息和图流形信息。大量实验表明所提出算法的良好性能。

著录项

  • 作者

    Kong, Deguang.;

  • 作者单位

    The University of Texas at Arlington.;

  • 授予单位 The University of Texas at Arlington.;
  • 学科 Computer Science.;Engineering Computer.
  • 学位 Ph.D.
  • 年度 2013
  • 页码 139 p.
  • 总页数 139
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号