Gaussian Process Q-Learning for Finite-Horizon Markov Decision Process

2025 (1)

  • Maximilian Bloor, Tom Savage, Calvin Tsay, Antonio del Rio Chanona, Max Mowbray, Gaussian Process Q-Learning for Finite-Horizon Markov Decision Process, Reinforcement Learning Conference, 2025