首页> 外文学位 >Boosting and online learning for classification and ranking.
【24h】

Boosting and online learning for classification and ranking.

机译:促进和在线学习以进行分类和排名。

获取原文
获取原文并翻译 | 示例

摘要

This dissertation utilizes boosting and online learning techniques to address several real-world problems in ranking and classification. Boosting is an optimization tool that works in the function space (as opposed to parameter space) and aims to find a model in batch mode. Typically, boosting iteratively constructs weak hypotheses with respect to different distributions over a fixed set of training instances and adds them to a final hypothesis. Online learning is the problem of learning a model when the instances are provided over trials. In each trial, a new sample is presented to the learner, the learner predicts its class label and then receives some feedback (partial or complete). The learner updates its model by utilizing the feedback and then a new trial starts.;We consider several learning problems, including the usage of side information in ranking and classification, learning to rank by optimizing a well-known information retrieval measure called NDCG, and online classification with partial feedback.;Using side information to improve the performance of learning techniques has been one research focus of machine learning community for the last decade. In this dissertation, we utilize the abundance of unlabeled instances to improve the performance of multi-class classification, and exploit the existence of a base ranker to improve the performance of learning to rank, both using the boosting technique.;Direct optimization of information retrieval evaluation measures such as NDCG and MAP has received increasing attention in the recent years. It is a difficult task because these measures evaluate the retrieval performance based on the ranking list of documents induced by the ranking function, and therefore they are non-continuous and non-differentiable. To overcome this difficulty, we propose to optimize the expected value of NDCG and utilize boosting technique as the optimization tool.;Online classification with partial feedback is recently introduced and has applications in contextual advertisement and recommender systems. We propose a general framework for this problem based on exploration vs. exploitation tradeoff technique and introduce effective approaches to automatically tune the exploration vs. exploitation tradeoff parameter.
机译:本文利用提升和在线学习技术来解决排名和分类中的几个现实问题。 Boosting是一种优化工具,可以在函数空间(而不是参数空间)中工作,并且旨在以批处理模式查找模型。通常,增强迭代可针对固定的一组训练实例相对于不同分布迭代地构造弱假设,并将其添加到最终假设中。当通过试用提供实例时,在线学习是学习模型的问题。在每个试验中,都会向学习者提供一个新样本,学习者预测其班级标签,然后接收一些反馈(部分或全部)。学习者通过利用反馈来更新其模型,然后进行新的试验。我们考虑了一些学习问题,包括在排名和分类中使用辅助信息,通过优化称为NDCG的著名信息检索方法来学习排名以及在过去的十年中,使用辅助信息来提高学习技术的性能一直是机器学习社区的研究重点之一。本文利用boost技术,利用大量未标记实例来提高多类分类的性能,并利用基本排序器的存在来提高学习排序的性能。近年来,诸如NDCG和MAP之类的评估措施受到越来越多的关注。这是一项艰巨的任务,因为这些措施基于由排名功能引发的文档的排名列表来评估检索性能,因此它们是不连续且不可区分的。为克服这一困难,我们提出了对NDCG的期望值进行优化的方法,并采用Boosting技术作为优化工具。近年来,引入了具有部分反馈的在线分类,并将其应用于上下文广告和推荐系统中。我们提出了一个基于勘探与开发权衡技术的通用框架,并提出了有效的方法来自动调整勘探与开发权衡参数。

著录项

  • 作者

    Valizadegan, Hamed.;

  • 作者单位

    Michigan State University.;

  • 授予单位 Michigan State University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 144 p.
  • 总页数 144
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号