Manifund foxManifund
Home
Login
About
People
Categories
Newsletter
HomeAboutPeopleCategoriesLoginCreate
header image

Technical AI safety

10 proposals
66 active projects
$3.79M
Grants186Impact certificates20
Anthony avatar

Anthony Duong

1-month full-time contributing software to Inspect

Technical AI safety
1
1
$10K raised
🦑

David Chanin

Support for SAELens and other Decode Research Projects

Technical AI safety
2
1
$6.8K raised
RCS-architect avatar

Arifa Khan

Preventing AI Catastrophe Through Economic Mechanisms

The Reputation Circulation Standard - Implementation Sprint

Science & technologyTechnical AI safetyAI governanceGlobal catastrophic risks
1
0
$0 / $50K
🥦

Jord Nguyen

Benchmarking and comparing different evaluation awareness metrics

LLMs often know when they are being evaluated. We’ll do a study comparing various methods to measure and monitor this capability.

Technical AI safetyAI governance
1
0
$0 / $15K
🐸

Itay Yona

Cultivating a Mechanistic Interpretability Community

Sustaining and Scaling a Grassroots Research Collective for Neural Network Interpretability and Control

Science & technologyTechnical AI safetyEA community
2
1
$0 / $20K
🐷

H

Funding researcher on frontier AI governance inclusive of global majority

Technical AI safetyAI governance
0
1
$87.5K raised
skunnavakkam avatar

Sudarsh Kunnavakkam

Tooling + Model Orgs for CoT Faithfulness Research

Building model organisms of CoT and Python packages for intervention in reasoning traces

Technical AI safety
1
3
$3K raised
🐸

Belinda Mo

Silicon Valley: the Musical

A comedy that gets people thinking about AI in society

Technical AI safetyAI governanceEA community
3
2
$150 / $20K
🐧

Bryce Meyer

TransformerLens - Bridge Funding

Science & technologyTechnical AI safety
6
4
$20K raised
los_angeleno1176 avatar

Kristina Vaia

AI Safety Los Angeles (AISLA)

The official AI safety community in Los Angeles

Technical AI safetyAI governanceGlobal catastrophic risks
2
7
$2.5K / $15K
🦀

Chi Nguyen

Acausal research and interventions

Making sure AI systems don't mess up acausal interactions

Technical AI safetyGlobal catastrophic risks
7
3
$70K raised
Apart avatar

Apart Research

Keep Apart Research Going: Global AI Safety Research & Talent Pipeline

Funding ends June 2025: Urgent support for proven AI safety pipeline converting technical talent from 26+ countries into published contributors

Technical AI safetyAI governanceEA community
32
39
$131K raised
🐠

Sarah Wiegreffe

The First Actionable Interpretability Workshop at ICML 2025

https://actionable-interpretability.github.io/

Science & technologyTechnical AI safety
1
1
$1.5K raised
Reese avatar

Igor Ivanov

Demonstration of LLMs deceiving and getting out of a sandbox

Technical AI safety
2
2
$3.11K raised
Asterisk-Magazine avatar

Asterisk Magazine

Asterisk AI Blogging Fellowship

Technical AI safetyAI governanceGlobal catastrophic risks
6
2
$70K raised
Connoraxiotes avatar

Connor Axiotes

'Making God': a Documentary on AI Risks for the Public

Geoffrey Hinton & Yoshua Bengio Interviews Secured, Funding Still Needed

Science & technologyTechnical AI safetyAI governanceGlobal catastrophic risks
16
35
$205K raised
🥥

tamar rott shaham

The First Workshop on Mechanistic Interpretability for Vision

Science & technologyTechnical AI safety
1
2
$1.5K raised
Jimmm avatar

Jim Maar

Implicit planning in LLMs Paper

Reproducing the Claude poetry planning results quantitatively

Technical AI safety
1
1
$1K raised

Complete Projects

StevePetersen avatar

Steve Petersen

Systems that "give a damn"

Teleology, agential risks, and AI well-being

Technical AI safetyAnimal welfare
5
5
$12K raised

Unfunded Projects

JaesonB avatar

Jaeson Booker

The AI Safety Research Fund

Creating a fund exclusively focused on supporting AI Safety Research

Technical AI safety
0
16
$0 raised