Donate

This is a donation to this user's regranting budget, which is not withdrawable.

$15,000total balance

$15,000charity balance

$0cash balance

$0 in pending offers

About Me

most interested in technical AI alignment, control, and security projects. I'm also interested in funding macrostrategy and coordination (things like my Shallow Review of the field but better).

Projects

Estimating the effect of semantic duplicates of test data

Shallow review of AI safety 2024

Giant Database of Replication Studies

Outgoing donations

The Metascience Observatory

$10000

3 months ago

PIBBSS - General Programs funding or specific funding

$25000

3 months ago

Development of a Cautionary Tale Feature Film about Gradual Disempowerment

$85000

6 months ago

Development of a Cautionary Tale Feature Film about Gradual Disempowerment

$12480

6 months ago

Keep Apart Research Going: Global AI Safety Research & Talent Pipeline

$4500

9 months ago

Orexin Pilot Experiment for Reducing Sleep Need

$500

10 months ago

Virtue-Ethical Rationality and Training Dynamics

Comments

The Metascience Observatory

Gavin Leech

3 months ago

I've been following Dan's work for years and think this could go pretty far with agents.

The state of the art in collating followup studies is extremely poor for stupid reasons and so easy to beat.

He's not naive about the technical issues nor the likely reception in academia and has some cool ideas for getting the results into actual discussions.

PIBBSS - General Programs funding or specific funding

Gavin Leech

3 months ago

Quite often when I meet some impressive new person in safety, it turns out they got in / went pro through PIBBSS. I'm particuarly excited to see how their fellows' conceptual work goes during the theory revival.

AFAIK they usually only have <1 year runway despite running a long-term hits-based strat. I'd like to see what they can do with more freedom.

Bonus: They have matching funds with SFF right now.

CoI: I've volunteered on the admissions team in the past.

Development of a Cautionary Tale Feature Film about Gradual Disempowerment

Gavin Leech

6 months ago

Filling the minimum myself. Good luck!

Development of a Cautionary Tale Feature Film about Gradual Disempowerment

Gavin Leech

8 months ago

Petr and his team are actual filmmakers making some of the only AI art I admire. I can corroborate the key claim above about his augmented work being > an order of magnitude cheaper than traditional film production. The 12 month timeframe until _screenings_ is crazy fast but doable if the script comes together soon. I trust Petr to not make it manipulative (appropriate to the theme) and he's planning to consult the 'Gradual Disempowerment' authors to keep it real.

Major distribution is a whole other question, but he has one lead. Deals and lawyer latency is what's be most likely make the ambitious timeline slip.

Another concern is whether others will have taken up the theme in 12 months' time (see [e.g.](https://www.imdb.com/news/ni65114720/)).

(I'd have funded the project's full minimum but I'm tapped out.)

Conflict of interest: I worked with Petr on the Dwarkesh pod.

Keep Apart Research Going: Global AI Safety Research & Talent Pipeline

Gavin Leech

9 months ago

@gleech I wonder if putting the full $1m target up on Manifund suppresses donors by making their donation seem smaller. But it has the advantage of being fully honest.

Keep Apart Research Going: Global AI Safety Research & Talent Pipeline

Gavin Leech

9 months ago

I like Apart; I like their results.

The top-of-funnel reference class they're in includes Bluedot, Nontrivial, AISC, aisafety.info. Three of those groups also struggle for institutional funding. Two of those groups are also trying to put out real research as legible output. (MATS isn't top-of-funnel in this sense.)

What do we want from this kind of effort?

leaders who know what they're talking about technically (check)
friendly (check)
otherwise good vibes and epistemics (check)
actually doing stuff; empirical; e.g. seeking (proxy) feedback for ideas from conference reviews (check)
throughput (check)
buy-in from ML or safety or governance or whatnot (mixed?)

I find it difficult to estimate the value of openness and friendliness in field-building but it's not small, and this is not their only selling point. Good luck!

Orexin Pilot Experiment for Reducing Sleep Need

Gavin Leech

10 months ago

strong team with nose in the game

Virtue-Ethical Rationality and Training Dynamics

Gavin Leech

10 months ago

Peli is a frugal philosopher who cowrote one of my favourite essays on alignment.

The idea: He wants to invent virtue post-training, inspired by e.g. the actual normative practice of fields like maths, which often appeals to self-instantiation. It's an open question whether current capabilities allow for instilling stable loops like this and I'm glad someone is trying.

Counterfactual: The current project struck out from the usual funders, I guess because theory is de-emphasised now, or because virtue ethics is usually unoperationalisable, or because he's not good at sales, or because they don't know the following about him.

Track record: Besides the (great) above essay: I've worked with Peli on two technical ML research papers and was impressed with his experimental skill, design skill, and precision. He also did some invited technical replication work on Turner 2023. He's competent at ML experiments at the level of a decent PhD student in the field. He is very used to working totally independently from inception to .

Concerns: If I didn't know the above, the doc would worry me in how totally non-ML-technical it is (it's intentionally doing some original philosophical work as prerequisite for that part). He's also been looking for collaborators and not finding much success; I hope that giving him independent funding makes this search an easier pitch.

I expect distribution to be the weakest part of the project. The inferential distance might be too high for the narrower part of the ingroup audience to bridge to him, even with him presenting them good data. But places like PAW, AIES and HAAISS will certainly engage. He's capable of doing the conference paper grind but doesn't seem motivated by it. Maybe a collaborator could bring the will to actively disseminate the results. But if distribution is the worst risk for a speculative ambitious project then we're in a good spot.

Cost-effectiveness: Very high, $40k / FTE for taking an intriguing idea and bringing it to testability.

Conflict of interest: As noted, Peli has worked on several projects at my company Arb and I've known him for years on Twitter.

Shallow review of AI safety 2024

Gavin Leech

10 months ago

@CarmenCondor my bad - here it is!

Shallow review of AI safety 2024

Gavin Leech

10 months ago

Final report

Description of subprojects and results, including major changes from the original proposal

The post went up roughly on time (29th December) and was fairly well-received (though it garnered less karma than last year). Comments were good and only the Alex Altair entry required notable edits. This is evidence but not strong evidence that the current version is error-free.

Our conference scrape provided some academic work which I think is underappreciated on LW but less than I hoped.

I'm very happy with the new fields (target case and broad approach) and our data entry on them.

Change: Following comments from funders we didn't do the "glossy" PDF version. Surplus money will go towards the 2025 version.

Spending breakdown

100% on salaries for the team. Thanks especially to Shoshannah Tekofsky, a highly graceful research manager.

Shallow review of AI safety 2024

Gavin Leech

over 1 year ago

A donor has sent another $10k, which will partly fund the 2025 edition.

Shallow review of AI safety 2024

Gavin Leech

over 1 year ago

Thanks very much to all donors! A private donor has offered to fill the difference so please stop sending me money (mods, if there's a way to close projects I can't see it). We've started work.

Make ALERT happen

Gavin Leech

over 2 years ago

The predecessor was my most important project last year. I've personally verified that there's a great deal of demand for some version of this ("customer" orgs and institutions and "supplier" volunteers). Nuno has some rare and essential qualities (honesty, clarity, infovorism) while lacking some others. But the shoestring version still excites me and I vote with my feet.

Transactions

For	Date	Type	Amount
The Metascience Observatory	3 months ago	project donation	10000
PIBBSS - General Programs funding or specific funding	3 months ago	project donation	25000
Manifund Bank	6 months ago	deposit	+50000
Development of a Cautionary Tale Feature Film about Gradual Disempowerment	6 months ago	project donation	85000
Development of a Cautionary Tale Feature Film about Gradual Disempowerment	6 months ago	project donation	12480
Manifund Bank	6 months ago	deposit	+12480
Keep Apart Research Going: Global AI Safety Research & Talent Pipeline	9 months ago	project donation	4500
Orexin Pilot Experiment for Reducing Sleep Need	10 months ago	project donation	500
Virtue-Ethical Rationality and Training Dynamics	10 months ago	project donation	10000
Manifund Bank	11 months ago	deposit	+100000
Manifund Bank	over 1 year ago	withdraw	10000
Shallow review of AI safety 2024	over 1 year ago	project donation	+10000
Manifund Bank	over 1 year ago	withdraw	10860
Shallow review of AI safety 2024	over 1 year ago	project donation	+8000
Shallow review of AI safety 2024	over 1 year ago	project donation	+1000
Shallow review of AI safety 2024	over 1 year ago	project donation	+1000
Shallow review of AI safety 2024	over 1 year ago	project donation	+10
Shallow review of AI safety 2024	over 1 year ago	project donation	+500
Shallow review of AI safety 2024	over 1 year ago	project donation	+50
Shallow review of AI safety 2024	over 1 year ago	project donation	+300
Make ALERT happen	over 2 years ago	project donation	5000
Manifund Bank	over 2 years ago	deposit	+5000