What this grant produced
This grant, in combination with my other Manifund grant, funded my transition into AI safety research and led to the formation of Lyptus Research, a not-for-profit AI safety research group based in Australia.
The published output during the grant period was our research note on offensive cyber time horizons. which extended METR's time-horizon methodology to offensive cybersecurity, grounded in a human expert study with 10 professional security practitioners.
Earlier in the grant period I also explored attack selection and monitor capability evaluation. That direction did not eventuate into a finished product.
Lyptus Research continues to work on dangerous capability evaluation using similar methods, and on technical research into alignment.
Cost breakdown
Across the two grants, approximately $70,000 USD supported the published cyber work
- Salaries: ~$50,000
- API credits: ~$10,000
- Contractors (study expert participants): ~$10,000
- Infrastructure, tools, and ops: ~$5,000
A further ~$20,000 USD supported my earlier transition into the field
- About 1.5 to 2 months of independent AI control research
- Australian domestic travel
- A couple of conferences in Australia
The project ran longer than originally scoped, and the shortfall was covered through our subsequent bridge funding grant.