Mitigating Reward Hacking Through RL Training Interventions | Manifund