Contextual Bandit with Adaptive Feature Extraction

机译：具有自适应特征提取的上下文强盗

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We consider an online decision making setting known as contextual bandit problem, and propose an approach for improving contextual bandit performance by using an adaptive feature extraction (representation learning) based on online clustering. Our approach starts with an off-line pre-training on unlabeled history of contexts (which can be exploited by our approach, but not by the standard contextual bandit), followed by an online selection and adaptation of encoders. Specifically, given an input sample (context), the proposed approach selects the most appropriate encoding function to extract a feature vector which becomes an input for a contextual bandit, and updates both the bandit and the encoding function based on the context and on the feedback (reward). Our experiments on a variety of datasets, and both in stationary and non-stationary environments of several kinds demonstrate clear advantages of the proposed adaptive representation learning over the standard contextual bandit based on "raw" input contexts.

机译：我们考虑一个称为上下文强盗问题的在线决策环境，并提出一种通过使用基于在线聚类的自适应特征提取（表示学习）来提高上下文强盗性能的方法。我们的方法开始于对未标记的上下文历史进行离线预训练（可以通过我们的方法来利用，但不能通过标准的上下文匪徒利用），然后是在线选择和修改编码器。具体而言，在给定输入样本（上下文）的情况下，建议的方法选择最合适的编码函数来提取特征向量，该特征向量成为上下文匪徒的输入，并根据上下文和反馈更新匪徒和编码函数（报酬）。我们在各种数据集上进行的实验，以及在几种固定和非固定环境下的实验，都证明了与基于“原始”输入上下文的标准上下文强盗相比，所提出的自适应表示学习具有明显的优势。

著录项

来源
《IEEE International Conference on Data Mining Workshops》|2018年|937-944|共8页
会议地点
作者
Baihan Lin; Djallel Bouneffouf; Guillermo A. Cecchi; Irina Rish;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
History; Decision making; Encoding; Clustering algorithms; Feature extraction; Context modeling; Standards;

机译：历史;决策;编码;聚类算法;特征提取;上下文建模;标准;

相似文献

外文文献
中文文献
专利

1. Adaptive Spectral–Spatial Multiscale Contextual Feature Extraction for Hyperspectral Image Classification [J] . Wang Di, Du Bo, Zhang Liangpei, IEEE Transactions on Geoscience and Remote Sensing . 2021,第3期

机译：高光谱图像分类的自适应光谱 - 空间多尺度上下文特征提取
2. Adaptive metamorphic testing with contextual bandits [J] . Helge Spieker, Arnaud Gotlieb The Journal of Systems and Software . 2020,第Jula期

机译：具有语境匪徒的自适应变质测试
3. Contextual Bandits with Continuous Actions: Smoothing, Zooming, and Adapting [J] . Akshay Krishnamurthy, John Langford, Aleksandrs Slivkins, Journal of machine learning research . 2020,第a期

机译：具有连续动作的上下文匪徒：平滑，缩放和调整
4. Contextual Bandit with Adaptive Feature Extraction [C] . Baihan Lin, Djallel Bouneffouf, Guillermo A. Cecchi, IEEE International Conference on Data Mining Workshops . 2018

机译：具有自适应特征提取的上下文匪
5. Adaptive Preference Learning with Bandit Feedback: Information Filtering, Dueling Bandits and Incentivizing Exploration [D] . Chen, Bangrui. 2017

机译：带有土匪反馈的自适应偏好学习：信息过滤，决斗土匪和激励探索
6. Action Centered Contextual Bandits [O] . Kristjan Greenewald, Ambuj Tewari, Predrag Klasnja, -1

机译：行动为中心的情境强盗
7. Adaptive Keywords Extraction with Contextual Bandits for Advertising on Parked Domains [O] . Yuan, Shuai, Wang, Jun, van der Meer, Maurice 2013

机译：基于语境匪的自适应关键词提取停放的域名

Contextual Bandit with Adaptive Feature Extraction

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅