首页> 外文会议>IEEE Data Science Workshop >Distance-Penalized Active Learning via Markov Decision Processes

【24h】

Distance-Penalized Active Learning via Markov Decision Processes

机译：马尔可夫决策过程的距离惩罚主动学习

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We consider the problem of active learning in the context of spatial sampling, where the measurements are obtained by a mobile sampling unit. The goal is to localize the change point of a one-dimensional threshold classifier while minimizing the total sampling time, a function of both the cost of sampling and the distance traveled. In this paper, we present a general framework for active learning by modeling the search problem as a Markov decision process. Using this framework, we present time-optimal algorithms for the spatial sampling problem when there is a uniform prior on the change point, a known non-uniform prior on the change point, and a need to return to the origin for intermittent battery recharging. We demonstrate through simulations that our proposed algorithms significantly outperform existing methods while maintaining a low computational cost.

机译：我们考虑在空间采样的背景下的主动学习的问题，其中通过移动采样单元获得测量。目标是本地化一维阈值分类器的变更点，同时最小化总采样时间，采样成本和行驶距离的函数。在本文中，我们通过将搜索问题建模为Markov决策过程，提出了一个用于主动学习的一般框架。使用该框架，我们在改变点之前存在均匀时，我们为空间采样问题提供时间最佳算法，在改变点上已知的不均匀，并且需要返回到间歇电池充电的原点。我们通过模拟展示我们所提出的算法显着优于现有方法，同时保持低计算成本。

著录项

来源
《IEEE Data Science Workshop》|2019年|155-159|共5页
会议地点
作者
Dingyu Wang; John Lipor; Gautam Dasarathy;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
learning (artificial intelligence); Markov processes; pattern classification; search problems;

机译：学习（人工智能）;马尔可夫过程;模式分类;搜索问题;

相似文献

外文文献
中文文献
专利

1. Policy learning in continuous-time Markov decision processes using Gaussian Processes [J] . Bartocci Ezio, Bortolussi Luca, Brazdil Tomas, Performance Evaluation . 2017,第nova期

机译：使用高斯过程的连续时间马尔可夫决策过程中的策略学习
2. Active Model Estimation in Markov Decision Processes [J] . Jean Tarbouriech, Shubhanshu Shekhar, Matteo Pirotta, JMLR: Workshop and Conference Proceedings . 2020,第2010期

机译：马尔可夫决策过程中的主动模型估计
3. An active-set strategy to solve Markov decision processes with good-deal risk measure [J] . Tu Shu, Defourny Boris Optimization Letters . 2019,第6期

机译：解决Markov决策过程的主动集策略，具有良好的风险衡量
4. Distance-Penalized Active Learning via Markov Decision Processes [C] . Dingyu Wang, John Lipor, Gautam Dasarathy IEEE Data Science Workshop . 2019

机译：通过马尔可夫决策过程距离惩罚的主动学习
5. Information Theoretic Learning Methods for Markov Decision Processes With Parametric Uncertainty [D] . Kumar, Peeyush 2018

机译：参数不确定马尔可夫决策过程的信息理论学习方法
6. Learning to maximize reward rate: a model based on semi-Markov decision processes [O] . Arash Khodadadi, Pegah Fakhari, Jerome R. Busemeyer 2014

机译：学习最大化奖励率：基于半马尔可夫决策过程的模型
7. Active Learning of Markov Decision Processes for System Verification [O] . Chen, Yingke, Nielsen, Thomas Dyhre 2012

机译：系统验证马尔可夫决策过程的主动学习

Distance-Penalized Active Learning via Markov Decision Processes

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅