Technical AI safety

10 proposals

66 active projects

$3.79M

Anthony Duong

1-month full-time contributing software to Inspect

Technical AI safety

$10K raised

🦑

David Chanin

Support for SAELens and other Decode Research Projects

Technical AI safety

$6.8K raised

Arifa Khan

Preventing AI Catastrophe Through Economic Mechanisms

The Reputation Circulation Standard - Implementation Sprint

Science & technology Technical AI safety AI governance Global catastrophic risks

$0 / $50K

🥦

Jord Nguyen

Benchmarking and comparing different evaluation awareness metrics

LLMs often know when they are being evaluated. We’ll do a study comparing various methods to measure and monitor this capability.

Technical AI safety AI governance

$0 / $15K

🐸

Itay Yona

Cultivating a Mechanistic Interpretability Community

Sustaining and Scaling a Grassroots Research Collective for Neural Network Interpretability and Control

Science & technology Technical AI safety EA community

$0 / $20K

🐷

Funding researcher on frontier AI governance inclusive of global majority

Technical AI safety AI governance

$87.5K raised

Sudarsh Kunnavakkam

Tooling + Model Orgs for CoT Faithfulness Research

Building model organisms of CoT and Python packages for intervention in reasoning traces

Technical AI safety

$3K raised

🐸

Belinda Mo

Silicon Valley: the Musical

A comedy that gets people thinking about AI in society

Technical AI safety AI governance EA community

$150 / $20K

🐧

Bryce Meyer

TransformerLens - Bridge Funding

Science & technology Technical AI safety

$20K raised

Kristina Vaia

AI Safety Los Angeles (AISLA)

The official AI safety community in Los Angeles

Technical AI safety AI governance Global catastrophic risks

$2.5K / $15K

🦀

Chi Nguyen

Acausal research and interventions

Making sure AI systems don't mess up acausal interactions

Technical AI safety Global catastrophic risks

$70K raised

Apart Research

Keep Apart Research Going: Global AI Safety Research & Talent Pipeline

Funding ends June 2025: Urgent support for proven AI safety pipeline converting technical talent from 26+ countries into published contributors

Technical AI safety AI governance EA community

$131K raised

🐠

Sarah Wiegreffe

The First Actionable Interpretability Workshop at ICML 2025

https://actionable-interpretability.github.io/

Science & technology Technical AI safety

$1.5K raised

Igor Ivanov

Demonstration of LLMs deceiving and getting out of a sandbox

Technical AI safety

$3.11K raised

Asterisk Magazine

Asterisk AI Blogging Fellowship

Technical AI safety AI governance Global catastrophic risks

$70K raised

Connor Axiotes

'Making God': a Documentary on AI Risks for the Public

Geoffrey Hinton & Yoshua Bengio Interviews Secured, Funding Still Needed

Science & technology Technical AI safety AI governance Global catastrophic risks

$205K raised

🥥

tamar rott shaham

The First Workshop on Mechanistic Interpretability for Vision

Science & technology Technical AI safety

$1.5K raised

Jim Maar

Implicit planning in LLMs Paper

Reproducing the Claude poetry planning results quantitatively

Technical AI safety

$1K raised

Complete Projects

Steve Petersen

Systems that "give a damn"

Teleology, agential risks, and AI well-being

Technical AI safety Animal welfare

$12K raised

Unfunded Projects

Jaeson Booker

The AI Safety Research Fund

Creating a fund exclusively focused on supporting AI Safety Research

Technical AI safety

$0 raised