Abstract: Reinforcement Learning (RL) agents optimize policies based on provided rewards, yet may exploit unintended loopholes in the reward design, a phenomenon known as reward hacking. With the rise ...
Abstract: Formalization of intended requirements is indispensable when using formal methods in software development. However, translating Natural Language (NL) requirements into formal specifications, ...