arXiv cs.LG
· Papers
Exploration and Online Transfer with Behavioral Foundation Models
arXiv:2606.29980v2 Announce Type: replace-cross Abstract: Zero-shot Transfer in Reinforcement Learning (RL) aims to train an agent that can generate optimal policies for any reward function, without additional learning at transfer time, while training only on reward-free trajectories. For their generality over tasks, s