首页> 外文学位 >Conformal predictions in multimedia pattern recognition.
【24h】

Conformal predictions in multimedia pattern recognition.

机译:多媒体模式识别中的保形预测。

获取原文
获取原文并翻译 | 示例

摘要

The fields of pattern recognition and machine learning are on a fundamental quest to design systems that can learn the way humans do. One important aspect of human intelligence that has so far not been given sufficient attention is the capability of humans to express when they are certain about a decision, or when they are not. Machine learning techniques today are not yet fully equipped to be trusted with this critical task. This work seeks to address this fundamental knowledge gap. Existing approaches that provide a measure of confidence on a prediction such as learning algorithms based on the Bayesian theory or the Probably Approximately Correct theory require strong assumptions or often produce results that are not practical or reliable. The recently developed Conformal Predictions (CP) framework - which is based on the principles of hypothesis testing, transductive inference and algorithmic randomness - provides a game-theoretic approach to the estimation of confidence with several desirable properties such as online calibration and generalizability to all classification and regression methods.;This dissertation builds on the CP theory to compute reliable confidence measures that aid decision-making in real-world problems through: (i) Development of a methodology for learning a kernel function (or distance metric) for optimal and accurate conformal predictors; (ii) Validation of the calibration properties of the CP framework when applied to multi-classifier (or multi-regressor) fusion; and (iii) Development of a methodology to extend the CP framework to continuous learning, by using the framework for online active learning. These contributions are validated on four real-world problems from the domains of healthcare and assistive technologies: two classification-based applications (risk prediction in cardiac decision support and multimodal person recognition), and two regression-based applications (head pose estimation and saliency prediction in images). The results obtained show that: (i) multiple kernel learning can effectively increase efficiency in the CP framework; (ii) quantile p-value combination methods provide a viable solution for fusion in the CP framework; and (iii) eigendecomposition of p-value difference matrices can serve as effective measures for online active learning; demonstrating promise and potential in using these contributions in multimedia pattern recognition problems in real-world settings.
机译:模式识别和机器学习领域是设计可学习人类行为的系统的基本要求。迄今为止尚未得到足够重视的人类智能的一个重要方面是人类在确定某个决定或不确定时表达的能力。当今的机器学习技术尚未完全具备可胜任这项关键任务的能力。这项工作旨在解决这一基本知识鸿沟。现有的提供对预测的置信度的方法,例如基于贝叶斯理论或“可能近似正确”理论的学习算法,需要强有力的假设,或者经常产生不实际或不可靠的结果。最近开发的共形预测(CP)框架-基于假设检验,转导推断和算法随机性的原理-提供了一种博弈论方法来评估置信度,并具有一些理想的属性,例如在线校准和可归纳到所有分类本论文建立在CP理论的基础上,通过以下方法来计算可靠的置信度,以帮助在现实世界中进行决策:(i)开发一种学习核函数(或距离度量)的方法,以实现最优和准确适形预测因子(ii)在应用于多分类器(或多回归)融合时验证CP框架的校准特性; (iii)通过使用在线主动学习框架,开发一种将CP框架扩展到持续学习的方法。这些贡献已在医疗保健和辅助技术领域的四个现实世界问题上得到验证:两个基于分类的应用程序(心脏决策支持和多模式人识别中的风险预测)和两个基于回归的应用程序(头部姿势估计和显着性预测)在图像中)。获得的结果表明:(i)多核学习可以有效地提高CP框架的效率; (ii)分位数p值组合方法为CP框架中的融合提供了可行的解决方案; (iii)p值差异矩阵的特征分解可作为在线主动学习的有效措施;展示了在现实环境中将这些贡献用于多媒体模式识别问题的前景和潜力。

著录项

  • 作者单位

    Arizona State University.;

  • 授予单位 Arizona State University.;
  • 学科 Information Science.;Computer Science.;Artificial Intelligence.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 269 p.
  • 总页数 269
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号