Manifund foxManifund
Home
Login
About
People
Categories
Newsletter
HomeAboutPeopleCategoriesLoginCreate
2

2 months full-time contributing software to Inspect

Technical AI safety
Anthony avatar

Anthony Duong

ActiveGrant
$20,000raised
$20,000funding goal
Fully funded and not currently accepting donations.

Project summary

2 months full-time contributing software to Inspect.

What are this project's goals? How will you achieve them?

Goals:

  • Make improvements to Inspect.

  • Add more evals to Inspect.

  • Test my fit as a software engineer for evals.

  • Build career capital.

How I'll achieve them:

  • Concrete ways:

    • Port benchmarks not yet in Inspect (e.g. TheAgentCompany, RE-Bench, and MLGym).

    • Develop Python packages implementing collections of Inspect solvers, tools, and scorers (e.g. like Inspect Cyber).

    • Implement realistic test environments like WebArena for testing a wider range of agent scenarios in contained settings.

    • Build tools for analyzing log files/reviewing transcripts to identify reasons for failure (if Docent isn’t doing all of this).

    • Build tools for presenting collections of results in dashboards (i.e. contribute to ​​https://github.com/ArcadiaImpact/inspect_evals_dashboard).

    • Build tools for LM agents to use (e.g. search through https://github.com/aorwall/moatless-tools for tools which help/might be useful and build them in Inspect).

  • Default ways/in general:

    • Try to complete open issues in Inspect repos.

    • Ask the developers in the Inspect Slack workspace how to contribute.

How will this funding be used?

This is meant to replace as much of my salary in industry as possible (which would mean about $15,000 per month).

Who is on your team? What's your track record on similar projects?

Just me. I maintain open source projects like SAELens, neuronpedia, and SAEDashboard.

What are the most likely causes and outcomes if this project fails?

Causes:

  • I don't:

    • Ramp up on the codebase fast enough.

    • Have enough work for 1 month full-time.

Outcomes:

  • I don't:

    • Make any significant improvements to Inspect.

    • Add many evals to Inspect.

    • Know my fit as a software engineer for evals.

    • Build career capital.

How much money have you raised in the last 12 months, and from where?

None.

Comments2Donations2Similar8
seanpetersau avatar

Sean Peters

3 Months Career Transition into AI Safety Research & Software Engineering

I'd like to explore a research agenda at the intersection of time horizon model evaluation and control protocols.

Technical AI safety
4
1
$32K raised
AmritanshuPrasad avatar

Amritanshu Prasad

Suav Tech, an AI Safety evals for-profit

General Support for an AI Safety evals for-profit

Technical AI safetyAI governanceGlobal catastrophic risks
4
0
$0 raised
CarlosGiudice avatar

Carlos Rafael Giudice

Cash runway while I go through interviews/wait for OpenPhil's grant decision

I've self funded my ramp up for six months and interview/grant processes are taking longer than expected.

Technical AI safetyGlobal catastrophic risks
2
0
$0 raised
🍓

James Lucassen

More Detailed Cyber Kill Chain For AI Control Evaluation

Extending an AI control evaluation to include vulnerability discovery, weaponization, and payload creation

Technical AI safety
4
4
$0 raised
LawrenceC avatar

Lawrence Chan

Exploring novel research directions in prosaic AI alignment

3 month

Technical AI safety
5
9
$30K raised
🍓

James Lucassen

LLM Approximation to Pass@K

Technical AI safety
3
6
$0 raised
Justin avatar

Justin Olive

Inspect Evals

Funding to cover our expenses for 3 months during unexpected shortfall

Technical AI safetyAI governance
8
1
$250 / $50K
mfatt avatar

Matthew Farr

MoSSAIC

Probing possible limitations and assumptions of interpretability | Articulating evasive risk phenomena arising from adaptive and self modifying AI

Science & technologyTechnical AI safetyAI governanceGlobal catastrophic risks
1
0
$0 raised