Benchmarking Deceptive Improvement in Agentic Systems | Manifund