PAPER
Publications / 2025
No-Regret Thompson Sampling for Finite-Horizon Markov Decision Processes with Gaussian Processes
Jasmine Bayrooti, Sattar Vakili, Amanda Prorok, Carl Henrik Ek
NeurIPS 2025 · December 2025
Abstract
A no-regret Thompson sampling algorithm for finite-horizon Markov decision processes using Gaussian process models, providing efficient exploration in model-based reinforcement learning settings.
Paper