Nicholas Volta
Offline AI agents that help agencies analyze data and be compliant with data protocols, all while investigating cybercrime to sex offender violations.
Nada Amin
Building LemmaScript, a verification toolchain for TypeScript
Oliver Klingefjord
A publication about the institutions we need for powerful AI.
Caleb DeLeeuw
118 evals run across 3 NLA systems. EmergenceIndex 0.601 on Gemma-4-E2B NLA. Requesting $110,500 for 6-month benchmark program (LTFF-equivalent ask).
shivam dubey
Mapping the attention heads that push LLMs toward refusal vs. compliance, and building an inference-time defense against both single- and multi-turn jailbreaks.
Alex Wolf
Mirror is a programming language written BY AI FOR AI and written FOR HUMANS BY HUMANS.
Anju Chhetri
Godwin Abuh Faruna
Safety steering that actually works off English
A study to empirically study the depolarization effect of our values elicitation method
Gaetan Duchateau
Can a Distributed Sensorimotor System Reconstruct Without External Stimulus?
Yingnan Hao
This motion affects everything. Since only gravity can move all objects, this anomaly could unlock the secret to artificial gravity control.
Nika Novak
Ahmed
A fast, comprehensive directory of the people and orgs in AI safety: search, filter, and match.
John Greer
Kevin Yandoka Denamganai
Compositional Learning Behaviours as a Necessary Condition for Olympiad-Level Formal Theorem Proving
David Wood
The Longevity Escape Velocity Foundation (LEVF) seeks support for one of the most neglected and potentially important questions in aging research
Ryan Ingosi
An independent safety score for AI agents you can verify — deterministic, reproducible, auditable, and it never needs your private data.
Conor Plunkett
Benchmark for agent safety when spending users money. How often do they violate user intent and rules?