Caleb DeLeeuw
Tests whether SAE features survive NL compression. v0.1 result: EmergenceIndex 0.601. Scaling to broader models and adversarial robustness testing.
Nada Amin
Building LemmaScript, a verification toolchain for TypeScript
Daniel Culotta
thomascederborgsemail
Noa Hölzer
Runway for the core staff to design the program, find mentors and get funding
Alex Wolf
Mirror is a programming language written BY AI FOR AI and written FOR HUMANS BY HUMANS.
shivam dubey
Mapping the attention heads that push LLMs toward refusal vs. compliance, and building an inference-time defense against both single- and multi-turn jailbreaks.
Godwin Abuh Faruna
Safety steering that actually works off English
Petr Lebedev
A Veritasium for AI Safety.
Anju Chhetri
Kay Astle
Translating formal verification into physical gravity-biased safety interlocks. We have a working prototype; funding builds 10-15 MIL-SPEC Beta units for red-te
Nika Novak
Miguel Fernandez
A working Rust engine + in-browser verifier that turns any AI decision into a tamper-evident, independently checkable cryptographic receipt
Ahmed
A fast, comprehensive directory of the people and orgs in AI safety: search, filter, and match.
John Greer
Kevin Yandoka Denamganai
Compositional Learning Behaviours as a Necessary Condition for Olympiad-Level Formal Theorem Proving
Aashka Patel
AI Nutrition Labels For Everyday Consumers & AI Agents: Travel + POC Grant
Ryan Ingosi
An independent safety score for AI agents you can verify — deterministic, reproducible, auditable, and it never needs your private data.
Pedro Bentancour Garin
One year of bootstrapped development, four patent filings, seeking support to continue.