|
[MRR+26]
Anirban Majumdar, Ritam Raha, Rajarshi Roy, David Parker and Marta Kwiatkowska.
About Time: Model-free Reinforcement Learning with Timed Reward Machines.
In Proc. 35th International Joint Conference on Artificial Intelligence (IJCAI'26).
2026.
[pdf]
|
|
Notes:
An extended version is available at https://arxiv.org/abs/2512.17637.
|