Oxford logo
[MRR+26] Anirban Majumdar, Ritam Raha, Rajarshi Roy, David Parker and Marta Kwiatkowska. About Time: Model-free Reinforcement Learning with Timed Reward Machines. Technical report 2512.17637, arxiv. 2026. [pdf] https://arxiv.org/abs/2512.17637
Downloads:  pdf pdf (11.63 MB)

QAV:

Home

People

Projects

Publications