Emmanuel king
A production-ready mathematical governance layer (C+R+S simplex, Control Barrier Functions, Lyapunov stability) that enforces Continuity, Reciprocity & Sovereig
Ahmed
Every person in AI safety as a face on a world map, auto-filled by a scraper and matched to the labs, mentors and funders looking for them.
Vakeesan Mahalingam
Open-source safety infrastructure for budget caps, verification, rollback, and audit trails in autonomous coding agents.
Eric Moore
Cryptographic attestation, runtime conscience, and an unfilterable kill switch. Live on the App Store and Google Play in 29 languages. AGPL, mission-locked
Naufal Ridwan
Building the technical foundations needed to formalize and computationally explore adaptive constraints.
Abhishek Mishra
Assessing whether Persona drift be detected in conditionally misaligned models using the assistant axis and potentially analyzing the reasons for failure
Saurav Panigrahi
Accepted ICML 2026 workshop paper on cross-constitution drift in LLMs; seeking $2,050 travel support to present in Seoul and gather research feedback.
Conor Plunkett
Benchmark for agent safety when spending users money. How often do they violate user intent and rules?
Zackery Sayers
Productionizing Mnemosyne, a local inspectable memory layer for LLM agents, plus an open benchmark for how memory and autonomy change AI-agent safety.
Shaun Srirangam
Tutankhamun Castillo El-Bey
Cryptographically signed, independently verifiable receipts for what AI agents actually did, anchored to Bitcoin so the record can't be quietly rewritten.
Mark McArthey
After 10 months running as my daily companion, this persistent AI is exposing failure modes in memory contamination, confabulation, and emotional drift.
Alex Chao
Funding AI credits and expert review for faith-facing AI evaluation projects
Amrutha M
Seeking support to build a prototype AI model drift monitoring platform and validate it with investors and early adopters.
Jake Prokopets
A contamination-free benchmark for measuring whether LLMs can forecast, or whether they're just remembering.
A three-month interpretability project: a clean toy model of computation in superposition via constrained attention, isolating the MLP. Output: a paper.
Yuchen Liu
Independent collective. Φ-Arena open benchmark, 3 ICLR 2027 papers (Φ-Arena, mechinterp, energy-bounded) — kickstart for a 10-year program.
Dr. Smadar Itskovich
A Research Agenda for Sovereign Capability
Camila Blank
Jash Vira
Does harmful fine-tuning data cause broad misalignment only when the model already recognises the target behaviour as a norm violation?