Manifund

Comments

The Deal of the Century (for AI)

Ryan Kidd

3 months ago

@Austin I'll share more tomorrow

Synthesizing Standalone World-Models

Ryan Kidd

3 months ago

Hey Thane! What other sources of funding have you sought? What are the largest sources of uncertainty for this project, or reasons it might fail to have impact?

Establishing a national hub for AI safety awareness and outreach

Ryan Kidd

4 months ago

@Seth-Momanyi thanks! I'm quite familiar with PauseAI. I just think it's not accurate to say that this proposal is for the first AI safety hub in Africa. Perhaps "the first AI safety hub in Kenya" would be more accurate?

Establishing a national hub for AI safety awareness and outreach

Ryan Kidd

4 months ago

Isn't the Equiano Institute the "first AI safety hub in Africa"?

Ulyssean

Ryan Kidd

4 months ago

Is there any more information?

Amplifying AI extinction risk awareness before it’s too late

Ryan Kidd

4 months ago

What does marginal funding buy, given the ~$200k from other sources?

The AI Safety Research Fund

Ryan Kidd

6 months ago

@JaesonB Thank you for these great responses!

The AI Safety Research Fund

Ryan Kidd

6 months ago

@JaesonB, if I was in your shoes, I would prioritize these things in order:

Talk to ARM and ask them how they are capacity constrained, then help alleviate that constraint.
If ARM is constrained by hiring great grantmakers, help them build a great hiring pipeline.
If ARM is constrained by funding, build a TLYCS for AI safety, to encourage mass donations to ARM.
If ARM is constrained by unfixable factors (e.g., no-one is driving it and they refuse help), first try cutting your teeth via regranting on Manifund. If this goes well, it doesn't seem crazy to set up another fund. Note that many early grantmakers for EA Funds also worked at Givewell or Open Philanthropy.

The AI Safety Research Fund

Ryan Kidd

6 months ago

I think there's a gap for an organization like The Life You Can Save for AI safety, which would encourage donations and pledges to top charities, but I don't really see how your proposed fund is a better alternative to scaling the AI Risk Mitigation Fund or Manifund. If the argument is "we bring in additional donors because of our non-longtermist affiliation", the same could be said for ARM and Manifund. If the argument is "we add additional grantmaker capacity" I would counter "why can't these same grantmakers join ARM or just use Manifund?" (Possible answer: maybe ARM's bar is too high and Manifund isn't an attractive target for donations.) Basically, I think we do need more funders in the medium-long term, but I think the experience and reputation of the ARM grantmakers is much higher than your proposed grantmakers, on average, and Manifund already exists as a short-term regranting solution that I would rather grow. I don't want this project to not happen at all, to be clear, but I would rather it be rescoped as something closer to TLYCS, as there is a much larger gap here than for another funder constrained by the same things as existing funders, albeit less experienced.

The AI Safety Research Fund

Ryan Kidd

6 months ago

@JaesonB, I also fear those capacity constraints. I'm curious why you think the solution is a separate fund, rather than alleviating the capacity constraints of an already-proven fund? Additionally, why wouldn't exactly the same constraints (e.g., money, competent grantmakers) bottleneck your fund?

The AI Safety Research Fund

Ryan Kidd

6 months ago

@JaesonB, what about the AI Risk Mitigation Fund? They're not focused exclusively on longtermism and have the same grantmakers as the LTFF.

Research Engineering Camp for Alignment Practitioners (RECAP) [formerly STARA]

Ryan Kidd

9 months ago

It seems like parts of your linked grant proposal were directly copied from TARA without citation, e.g., "grow and support an ambitious, influential, and technically capable local community focused on preventing the most harmful impacts of AI." The KPIs also seem to be copied without citation. Also, the name "STARA" is "TARA" with an "S". Did you ask TARA if you could copy their materials and name?

Out of This Box: AI Safety Musical

Ryan Kidd

9 months ago

Main points in favor of this grant

I think that the AI safety movement could benefit from stronger messaging in popular culture; a comedy musical seems a great addition! The dark subject matter around AI extinction has pretty low mass appeal, but dark comedies can be very popular.
I found the play hilarious and incisive. The ficitonal AI lab staff were charicatures of very real personalities and movements currently involved in the AI race. The musical elements were clever and well-delivered. I had a blast! I want to incentivise more high-quality, socially impactful art regarding the AGI race.
I don’t expect Open Phil or SFF to start funding plays anytime soon, so I expect my regrant is quite counterfactual.

Donor's main reservations

On priors, it seems unlikely that this play will take off internationally, even if it is successful in Berlin. If the play is only of interest to a niche EA audience, this might be low impact. I encourage the producers to screen the play to a non-EA audience, survey them, and perhaps make modifications to increase mass appeal.
My regrant could have funded an AI safety researcher for ~3 weeks FTE. It’s hard to judge whether the play’s impact will stack up compared to this. In the end, I made a judgement based on the relative abundance of funding for AI safety researchers post-Open Phil RFPs.

Process for deciding amount

I decided to regrant $6.4k, which was twice the amount raised so far based on exchange rates at the time I made the grant. This amount was chosen to be high enough to significantly contribute to the play going ahead, while being low enough to encourage further crowdfunding. Also, I made a heuristic comparison against similar grants I’ve made.

Conflicts of interest

I have no conflicts of interest to declare.

AI Safety Atlas

Ryan Kidd

10 months ago

I think AI Safety Atlas is awesome! I highly encourage further donations.

Investigating and informing the public about the trajectory of AI

Ryan Kidd

11 months ago

@Meeri At this point I don't feel qualified to comment and I defer to @Jsevillamol.

Bridge Funding for the Sydney AI Safety Hub (SASH)

Ryan Kidd

11 months ago

Some of this review is copied from my recent regrant to TARA.

Main points in favor of this grant

I’ve visited several Australian AI safety and EA conferences this year (and for the past 5 years) and I think that the local talent base is quite strong. MATS has accepted ~12 Australian scholars (~4% of all alumni) and they did great! Australians can get work visas for the UK and USA (E-3) relatively easily, which means they can emigrate to the SF Bay Area and London for AI safety work.
I’ve been impressed by Yanni’s operations and marketing ability in regards to TARA, the AI Safety ANZ Careers Conference 2024, and AI Safety ANZ’s recent seminar/workshop series.
Australia might have a role as a "middle-power" between the US and China in facilitating international AI treaties. The Sydney AI Safety Hub and its members might help support the formation of an Australian AI Safety Institute or otherwise support the Australian government and industry bodies in furthering AI safety. Additionally, Australia, as a member of Five Eyes and a NATO partner, might have a role in tracking foreign AGI projects and building international coalitions for AI safety.
I think that physical coworking spaces for AI safety have a strong track record for spawning useful projects (e.g., ARC Evals/METR in Constellation, Apollo Research in MATS London Office/LISA), facilitating talent flow into AI safety organizations, and supporting emerging and independent researchers with a strong community. Sydney has a few AI safety organizations (e.g., Gradient Institute, Harmony Intelligence) about which a hub can form and is proximal to talent sources like Sydney University and Sydney's corporate tech scene.
$5k to secure an office space for two months seems really cheap! I think that $30k for a year of office space is similarly cheap and I would be surprised if this is not funded.

Donor's main reservations

My grant is quite small and will only support the office for two months. AI Safety ANZ will need to secure permanent funding from a larger funder soon.
Sydney University, which is next door to SASH, is ranked 69th internationally for computer science. This is not an impressive ranking relative to the top US or UK universities, many of which do not have nearby AI safety offices (exceptions: AISST office near Harvard/MIT, Constellation/FAR Labs near UC Berkeley, LISA near ICL, the late Trajan House near Oxford). However, I think Australia ought to have an AI safety office given the strength of AI safety talent I regularly see coming from Australia, and Sydney seems like the natural location (though Canberra or Melbourne are also possible, due to their proximity to Seth Lazar's and Dan Murfet's labs, respectively).
It's possible that there is not enough strong talent or new orgs in Sydney to justify an office the size of SASH. To my knowledge, there is no dedicated AI safety research group at the University of Sydney. TARA should bring in some talent, however, and I encourage Yanni to help incubate any new orgs/projects that emerge in SASH.

Process for deciding amount

I decided to fund the entire amount because it's relatively small and AI Safety ANZ urgently needs funding to secure the office space for the next two months.

Conflicts of interest

I am an Australian citizen and have personal and professional ties to Australian AI safety and EA communities. Yanni recently invited me to present at the AI Safety ANZ Careers Conference 2024, although I received no financial compensation. I am an Advisor to AI Safety ANZ, Yanni’s organization, though I receive no financial compensation for this role. This year, I presented at the Australian AI Safety Forum 2024 and I have previously presented at EAGxAustralia 2022 and 2023, for which I received travel and accommodation funding.

Investigating and informing the public about the trajectory of AI

Ryan Kidd

11 months ago

@habryka it seems that Open Phil didn't want to fund FrontierMath, so Epoch went to OpenAI for funding. I think more funding for Epoch would be good to reduce reliance on commercial partners who might require dataset access.

Investigating and informing the public about the trajectory of AI

Ryan Kidd

11 months ago

@KabirKumar If concerned, surely it's good to fund Epoch via Manifund to reduce their dependence on OpenAI?

Video essay on risks from AI accelerating AI R&D

Ryan Kidd

about 1 year ago

Have you obtained approval from Epoch, Leopold Aschenbrenner, and Planned Obsolescence to adapt their content into a YouTube video?

Shallow review of cost-effectivness of technical AI safety orgs

Ryan Kidd

about 1 year ago

@RyanKidd Mikolaj should also declare all potential funding COIs (e.g., me) on his public outputs.

Shallow review of cost-effectivness of technical AI safety orgs

Ryan Kidd

about 1 year ago

Main points in favor of this grant

I want to see more public impact analysis of AI safety organizations. To my knowledge, there has been very little of this to date.

Donor's main reservations

Mikolaj has some experience with impact analysis, but I suspect that this project will be quite hard and might need more funding.

Process for deciding amount

I decided to partly fund this project to give other donors room to contribute and mitigate my conflict of interest.

Conflicts of interest

I am Co-Director at MATS, Board-Member at LISA, and advise several AI safety projects, including Catalyze Impact and AI Safety ANZ. I often work out of FAR Labs. As a Manifund Regrantor, I regularly fund AI safety projects, the success of which is probably a determining factor in my future status as a Regrantor. To mitigate these obvious conflicts of interest, I ask that Mikolaj share his key results in a public database and post, where any discrepancies can be identified.

The Midas Project

Ryan Kidd

about 1 year ago

How much funding do you currently have? What's your runway duration? Where have you applied for further funding this year?

Help launch the Technical Alignment Research Accelerator (TARA)!

Ryan Kidd

about 1 year ago

@RyanKidd *Also, I trust Yanni to refund the extra $3419 for room bookings if he is able to secure free event space.

Metaculus x Givewell Forecasting Tournament

Ryan Kidd

about 1 year ago

I like Metaculus and GiveWell. Incentivising forecasts of charity impact seems like a good way to improve charitable giving.

Help launch the Technical Alignment Research Accelerator (TARA)!

Ryan Kidd

about 1 year ago

Main points in favor of this grant

ARENA is a great curriculum and Yanni’s adaption to a weekend course seems well-considered.
I’ve visited several Australian AI safety and EA conferences this year (and for the past 5 years) and I think that the local talent base is quite strong. MATS has accepted ~12 Australian scholars (~4% of all alumni) and they did great!
Talking with several students and professional software engineers in Australia at recent conferences has convinced me that there is value in a weekend version of the ARENA course for people who want to upskill in AI safety research engineering, but cannot take off work and travel to the UK for a month to do ARENA.
I’ve been impressed by Yanni’s operations and marketing ability in regards to the AI Safety ANZ Careers Conference 2024 and AI Safety ANZ’s recent seminar/workshop series.
Australians can get work visas for the UK and USA (E-3) relatively easily, which means they can emigrate to the SF Bay Area and London for AI safety work.

Donor's main reservations

Yanni hasn’t yet found a good TA to run this course, which seems important for this to go well. I expect this to be much easier with funding, however, and I’m confident Yanni can find someone qualified, probably from the pool of MLAB and ARENA alumni.
As with all training programs that empower ML engineers and researchers, there is some concern that alumni will work on AI capabilities rather than safety. Therefore, it’s important to select for value alignment in applicants, as well as technical skill.

Process for deciding amount

I decided to fund the entire project because it’s quite cheap for the scale ($459-596/participant) and I expect I have a stronger and more informed opinion of the Australian talent pool than other funders. Also, I trust Yanni to refund the extra $3419 for room bookings if he is unable to secure free event space.

Conflicts of interest

AI Animals and Digital Minds 2025

Ryan Kidd

about 1 year ago

I funded this project for 1/4 of its cost based on the fraction of digital minds content from last year.

Making 52 AI Alignment Video Explainers and Podcasts

Ryan Kidd

about 1 year ago

To cover the minimal costs of the SB 1047 documentary ($15k).

AI Safety Textbook

Ryan Kidd

about 1 year ago

Any updates, guys?

Preventing Worst Case Pandemics Symposium @ Cambridge

Ryan Kidd

about 1 year ago

@Grace-Braithwaite Have you applied to grants elsewhere? How many attendees do you expect? Are you asking for the full $10k?

AI vs AI: Deepfake and GenAI Defense System for Combating Synthetic Media threat

Ryan Kidd

over 1 year ago

@Fibonan And another NY Times article, which says, "More than a dozen companies have popped up to offer services aimed at identifying whether photos, text and videos are made by humans or machines."

AI vs AI: Deepfake and GenAI Defense System for Combating Synthetic Media threat

Ryan Kidd

over 1 year ago

@Fibonan the public also seem to care a lot too. Here's an NY Times article about TrueMedia from April 2024.

AI vs AI: Deepfake and GenAI Defense System for Combating Synthetic Media threat

Ryan Kidd

over 1 year ago

@Fibonan it seems like GetReal Labs, TrueMedia, and Reality Defender have a similar business model. If they can get VC funding, I think you can too!

AI vs AI: Deepfake and GenAI Defense System for Combating Synthetic Media threat

Ryan Kidd

over 1 year ago

@Fibonan if I were a VC, I would bet I could make money off this product. I'm honestly really surprised that YC, Juniper, Metaplanet, Lionheart, Mythos, Polaris, Fifty Years, and Moonshot aren't interested.

AI vs AI: Deepfake and GenAI Defense System for Combating Synthetic Media threat

Ryan Kidd

over 1 year ago

This seems like a great tool that should definitely exist. In fact, it's so obviously useful, I don't know why you need Manifund! Do you need help applying for VC funding? Are you in urgent need of funds?

PIBBSS - General Programs funding or specific funding

Ryan Kidd

over 1 year ago

@RyanKidd Further discussion on LessWrong here.

PIBBSS - General Programs funding or specific funding

Ryan Kidd

over 1 year ago

@Austin Cheers! Re. the funding situation, a program like MATS has an annual budget somewhere between $3.6M and $8.1M, depending on the program size.

Diversify Funding for AI Safety

Ryan Kidd

over 1 year ago

How will you add value to donor outreach compared to organizations with a similar mission, such as Effective Giving, Founders Pledge, Future of Life Institute, Giving What We Can, Longview Philanthropy, etc.?
If you could, in what ways would you improve distributed funding projects like Manifund AI Safety Regranting, ACX Grants, GiveWiki, etc.?

PIBBSS - General Programs funding or specific funding

Ryan Kidd

over 1 year ago

Main points in favor of this grant

My inside view is that PIBBSS mainly supports “blue sky” or “basic” research, some of which has a low chance of paying off, but might be critical in “worst case” alignment scenarios (e.g., where “alignment MVPs” don’t work, or “sharp left turns” and “intelligence explosions” are more likely than I expect). In contrast, of the technical research MATS supports, about half is basic research (e.g., interpretability, evals, agent foundations) and half is applied research (e.g., oversight + control, value alignment). I think the MATS portfolio is a better holistic strategy for furthering AI alignment. However, if one takes into account the research conducted at AI labs and supported by MATS, PIBBSS’ strategy makes a lot of sense: they are supporting a wide portfolio of blue sky research that is particularly neglected by existing institutions and might be very impactful in a range of possible “worst-case” AGI scenarios. I think this is a valid strategy in the current ecosystem/market and I support PIBBSS!
In MATS’ recent post, “Talent Needs of Technical AI Safety Teams”, we detail an AI safety talent archetype we name “Connector”. Connectors bridge exploratory theory and empirical science, and sometimes instantiate new research paradigms. As we discussed in the post, finding and developing Connectors is hard, often their development time is on the order of years, and there is little demand on the AI safety job market for this role. However, Connectors can have an outsized impact on shaping the AI safety field and the few that make it are “household names” in AI safety and usually build organizations, teams, or grant infrastructure around them. I think that MATS is far from the ideal training ground for Connectors (although some do pass through!) as our program is only 10 weeks long (with an optional 4 month extension) rather than the ideal 12-24 months, we select scholars to fit established mentors’ preferences rather than on the basis of their original research ideas, and our curriculum and milestones generally focus on building object-level scientific skills rather than research ideation and “gap-identifying”. It’s thus no surprise that most MATS scholars are “Iterator” archetypes. I think there is substantial value in a program like PIBBSS existing, to support the development of “Connectors” and pursue impact in a higher-variance way than MATS.
PIBBSS seems to have decent track record for recruiting experienced academics in non-CS fields and helping them repurpose their advanced scientific skills to develop novel approaches to AI safety. Highlights for me include Adam Shai’s “computational mechanics” approach to interpretability and model cognition, Martín Soto’s “logical updatelessness” approach to decision theory, and Gabriel Weil’s “tort law” approach to making AI labs liable for their potential harms on the long-term future.
I don’t know Lucas Teixeira (Research Director) very well, but I know and respect Dušan D. Nešić (Operations Director) a lot. I also highly endorsed Nora Ammann’s vision (albeit while endorsing a different vision for MATS). I see PIBBSS as a highly competent and EA-aligned organization, and I would be excited to see them grow!
I think PIBBSS would benefit from funding from diverse sources, as mainstream AI safety funders have pivoted more towards applied technical research (or more governance-relevant basic research like evals). I think Manifund regrantors are well-positioned to endorse more speculative basic research, but I don’t really know how to evalutate such research myself, so I’d rather defer to experts. PIBBSS seems well-positioned to provide this expertise! I know that Nora had quite deep models of this while Research Director and in talking with Dusan, I have had a similar impression. I hope to talk with Lucas soon!

Donor's main reservations

It seems that PIBBSS might be pivoting away from higher variance blue sky research to focus on more mainstream AI interpretability. While this might create more opportunities for funding, I think this would be a mistake. The AI safety ecosystem needs a home for “weird ideas” and PIBBSS seems the most reputable, competent, EA-aligned place for this! I encourage PIBBSS to “embrace the weird”, albeit while maintaining high academic standards for basic research, modelled off the best basic science institutions.
I haven’t examined PIBBSS’ applicant selection process and I’m not entirely confident it is the best version it can be, given how hard MATS has found applicant selection and my intuitions around the difficulty of choosing a blue sky research portfolio. I strongly encourage PIBBSS to publicly post and seek feedback on their applicant selection and research prioritization processes, so that the AI safety ecosystem can offer useful insight. I would also be open to discussing these more with PIBBSS, though I expect this would be less useful.
My donation is not very counterfactual here, given PIBBSS’ large budget and track record. However, there has been a trend in typical large AI safety funders away from agent foundations and interpretability, so I think my grant is still meaningful.

Process for deciding amount

I decided to donate the project’s minimum funding ($25k) so that other donors would have time to consider the project’s merits and potentially contribute. Given the large budget and track record of PIBBSS, I think my funds are less counterfactual here than for smaller, more speculative projects, so I only donated the minimum. I might donate significantly more to PIBBSS later if I can’t find better grants, or if PIBBSS is unsuccessful in fundraising.

Conflicts of interest

I don't believe there are any conflicts of interest to declare.

🏅

Calibration City

Ryan Kidd

over 1 year ago

Main points in favor of this grant

I think prediction markets are a great forecasting mechanism and accurate forecasts are an essential component of good decision-making. I regularly consult Manifold, Metaculus, etc. for decision-relevant forecasts. Establishing the accuracy of these platforms seems crucial for widespread adoption of prediction markets in institutional decision-making.
I’m excited by the potential for Calibration City to track the accuracy of AI-specific forecasts, to aid AI safety and improve planning for transformative AI. I strongly encourage wasabipesto to create an interface tracking the accuracies of predictions about AI capabilities and AGI company developments.

Donor's main reservations

It’s possible that this tool doesn’t increase trust or uptake of prediction markets in decision-making because the interface is too abstract or concepts are too abstract. However, even so, this might prove useful to some individual decision makers or research projects.
It’s possible that the AI questions I am most interested in calibrating on belong to a class of long-horizon predictions that is not well-represented by the calibration of short-horizon, closed markets.

Process for deciding amount

I decided to fund this project $2k somewhat arbitrarily. I wanted to leave room for other donors and I didn’t view it as impactful in expectation as other $5k+ projects I’ve funded.

Conflicts of interest

I don't believe there are any conflicts of interest to declare.

MATS Program

Ryan Kidd

over 1 year ago

Update: mentors for the Winter 2024-25 Program released! Currently fundraising for the Summer 2025 Program; 48 mentors have already expressed interest!

Thank you to Manifund for collectively supporting ~11 additional research scholars with your collective donations so far! It's very important for us at MATS to continue seeking funding from diverse sources, which lets us run larger programs with further research diversity. We will continue to do our best to turn your donations into impact!

MATS Program

Ryan Kidd

over 1 year ago

Update: we have assembled a team of expert advisors to select mentors for the Winter 2024-25 Program, as we received 78 mentor applicants! Our mentor pool has never been more exciting!

AI, Animals, and Digital Minds 2024 Conference and Retreat

Ryan Kidd

over 1 year ago

Main points in favor of this grant

I think that research into the possible sentience or moral patienthood of digital minds is important and neglected. Relatively few philosophy academics are researching this matter and I suspect humanity might soon create highly intelligent forms of artificial life, potentially worth of human-level moral patienthood, but with radically different attitudes in society.
Conferences are a time-tested way to connect and develop a research community. I support further conferences on research into the potential sentience and moral patienthood of digital minds.
The conference was surprisingly cheap to fund! The conference organizers clearly care deeply about the cause areas they support, as they ran the event with no guaranteed funding.

Donor's main reservations

I don’t think that aligning early transformative AI systems with the interests of nonhumans is a good idea. I believe that the period during which transformative AI emerges will likely be fraught with risk and rapid societal change. I principally want to prevent catastrophic risk and value lock-in during this period and adding further complexity to the challenge of making AI corrigible to robust democratic institutions seems bad. I’d rather build corrigible AI to reduce catastrophic risk and prevent value lock-in, then proceed with a “long reflection” or other deep investigation into the all-things-considered best approach towards maximizing nonhuman flourishing within appropriate constraints (though I support efforts to improve animal welfare today).
I think that the conference organizers could have supported further researchers in AI sentience or moral patienthood, such as Robert Long, Patrick Butlin, or any of their coauthors here.
I think that the animal welfare aspects of the conference might make this project relatively easy for animal welfare donors to fund. It’s not clear to me why the conference organizers didn’t apply to EA Funds’ Animal Welfare Fund.

Process for deciding amount

I decided to fund the fraction of the project’s cost that went towards supporting sessions on AI sentience or moral patienthood, as opposed to using AI to further animal rights or facilitate interspecies communication. I estimated this from the ~1/4 of sessions focused on digital minds and rounded upwards: 1/4 * $5500 = $1375 ~ $1500. I don’t oppose using AI to improve interspecies communication or support animal welfare (both seem positive), but I prefer to dedicate my Manifund resources to projects where I am more counterfactual or impactful.

Conflicts of interest

I don't believe there are any conflicts of interest to declare.

Simplex - building our research team

Ryan Kidd

over 1 year ago

Any update from SFF or other funders?

Collaboration to develop a DAG formalism to express instrumentality

Ryan Kidd

over 1 year ago

I decided not to fund this, but accepted Matthew into MATS instead.

Preventing Worst Case Pandemics Symposium @ Cambridge

Ryan Kidd

over 1 year ago

I regranted an additional $2k to let the organizers launch the basic event, as per Grace's comment.

Preventing Worst Case Pandemics Symposium @ Cambridge

Ryan Kidd

over 1 year ago

@calebp, is this $1700 of the already committed $2000, or is this on top of that?

Preventing Worst Case Pandemics Symposium @ Cambridge

Ryan Kidd

over 1 year ago

What does $10k buy that $2k doesn't?

MATS Program

Ryan Kidd

over 1 year ago

Update: we recently published a blog post summarizing our takes on talent needs of technical AI safety teams based on 31 interviews with key figures in AI safety, including senior researchers, organization leaders, social scientists, strategists, funders, and policy experts.

AI Alignment Research Lab for Africa

Ryan Kidd

over 1 year ago

Any update on the lab progress?

MATS Program

Ryan Kidd

over 1 year ago

@RyanKidd Oops, I meant Winter 2023-24

MATS Program

Ryan Kidd

over 1 year ago

Update: we recently published our Winter 2024-25 Retrospective: https://www.lesswrong.com/posts/Z87fSrxQb4yLXKcTk/mats-winter-2023-24-retrospective

MATS Program

Ryan Kidd

over 1 year ago

Update: MATS is no longer in need of additional funding for our Summer 2024 Program. We are still accepting donations towards our Winter 2024-25 Program, however!

Research Staff for AI Safety Research Projects

Ryan Kidd

over 1 year ago

I'll likely regrant to this project because I think CAIS is great, but I'll first look for projects where my grants funge less with Open Phil, SFF, Longview, LTFF, etc.

AI Safety Textbook

Ryan Kidd

over 1 year ago

Main points in favor of this grant

Dan Hendrycks' AI safety textbook seems great, but principally serves as an introduction to the field, rather than an in-depth overiew of current technical AI safety research directions, which is the intent of this project. Periodically updated "topics courses" could serve as an equivalent source of value, but these might be bound to particularly universities and updatable on a slower timescale than an online textbook. I'm also enthused by Markov's plans to eventually integrate interactive content and live content from sources like Metaculus, Our World in Data, Stampy, and more.
I believe that the AI safety research field should grow 10-100x over the next 10-20 years and AI safety student groups should be a strong driver of this growth. Currently, I think AI safety student groups need more "plug-and-play" curricula to best prepare members for progression into research, engineering, and policy roles, especially at universities without dedicated AI safety courses like that based on Hendrycks' textbook. I think BlueDot Impact's AI Safety Fundamentals courses are great, but I don't see why BlueDot and CAIS should be the only players in this space and think there is some benefit from healthy competition/collaboration.
Charbel has road-tested content from the early stages of this textbook project with several AI safety university groups and courses with apparently good feedback.
I've been impressed with Charbel's LessWrong posts and nuanced takes on AI safety research agendas.
The online version of the textbook will be free and open-source (MIT License), which I think is important for introductory AI safety fieldbuilding materials to be maximally impactful.

Donor's main reservations

I think that the optimal form of this project is a continually updated online resource that periodically integrates new papers and research paradigms and therefore this project will eventually need long-term funding and permanent home. However, I believe that my grant will greatly assist Charbel and Markov in producing a proof-of-concept sufficient to secure long-term funding or institutional support. Additionally, the textbook MVP seems likely to be high-value for the near-term regardless of whether the project continues. Lastly, if the textbook is high-value and Charbel and Markov are unable to secure long-term funding, I'm sure it will be useful for established curriculum developers like BlueDot Impact.
I wonder if this project should actually be converted into a wiki once the MVP is developed. Markov has previously worked with Stampy and has mentioned that they might want to integrate some Stampy articles into the online textbook. However, even if this project is ideally a wiki, building a viable MVP seems crucial to securing long-term funding and core content for iterating upon.
I don't know Markov or the proposed editor, Professor Vincent Corruble, very well, which slightly decreases my confidence in the textbook quality. However, Markov comes highly recommended by Charbel, has previously worked as at Rational Animations in charge of AI safety, and has produced good-according-to-me content for the textbook so far. Professor Corruble is an Associate Professor Sorbonne Université and a UC Berkeley CHAI affiliate, which indicate he has the technical expertise to oversee the computer science aspects of the the textbook. I additionally recommend that Charbel and Markov enlist the support of further editors with experience in AI safety strategy and AI governance, as I believe these are critical aspects of the textbook.

Process for deciding amount

I chose to donate enough to fund the minimum amount for this project to proceed because:

I want full-time work on this textbook to commence immediately to minimize its time-to-impact and Markov is unable to do this until he receives confirmed funding for 6 months;
I think it is relatively unlikely that this project will be funded by Open Philanthropy or the LTFF and I have greater risk tolerance for projects like this;
I have 5x the grant budget I had last year and I think this project is probably more impactful than I would have considered necessary for a counterfactual $7.8k regrant made last year based on the projects I funded;
I didn't give more than the minimum amount as I feel my marginal funding is high-value for other projects and I think Charbel and Markov can likely secure additional funding from other sources (including other Manifund regrantors) if necessary.

Conflicts of interest

I don't believe there are any conflicts of interest to declare.

to build my thing | 17 - dont wanna go to college

Ryan Kidd

over 1 year ago

Have you tried YC?

Steganography via RL

Ryan Kidd

over 1 year ago

The links are broken; they go to a private Zotero page I cannot access.

AI Safety Textbook

Ryan Kidd

over 1 year ago

My bad, I missed the section this!

AI Safety Textbook

Ryan Kidd

over 1 year ago

How does this project compare to Dan Hencryck's AI safety textbook?

MATS Program

Ryan Kidd

over 1 year ago

Update: thanks to your donations, we were able to support an additional 8.5 scholars in the Winter 2023-24 Program, at an ex post cost of $22.4k/scholar! Thank you so much for your contributions to the field of AI safety :)

We are currently fundraising for our Summer 2024 Program and again expect to receive less funding than our ideal program. We can support marginal scholars at a cost of $24.4k/scholar. We currently have 1220 applicants for Summer 2024 and expect to accept ~3-5% (i.e., MIT's admissions rate). Given the high calibre of applicants and mentors, we would love further funding to support additional scholars!

We have announced the following mentors and hope to announce more as we confirm additional funding: https://docs.google.com/document/d/1sDnD9Igr3gkWX-N_l9W8itVBpqx-pChh-61atxGYkPc/edit

AI Safety Research Organization Incubator - Pilot Program

Ryan Kidd

about 2 years ago

Main points in favor of this grant

I think that there should be more AI safety organizations to: harness the talent produced by AI safety field-building programs (MATS, ARENA, etc.); build an ecosystem of evals and auditing orgs; capture free energy for gov-funded and for-profit AI safety organizations with competent, aligned talent; and support a multitude of neglected research bets to aid potential paradigm shifts for AI safety. As an AI safety organization incubator, Catalyze seems like the most obvious solution.
As Co-Director at MATS, I have seen a lot of interest from scholars and alumni in founding AI safety organizations. However, most scholars do not have any entrepeneurial experience and little access to suitable co-founders in their networks. I am excited about Catalyze's proposed co-founder pairing program and start-up founder curriculum.
I know Kay Kozaronek fairly well from his time in the MATS Program. I think that he has a good mix of engagement with AI safety technical research priorities, entrepeneurial personality, and some experience in co-founding an AI safety startup (Cadenza Labs). I do not know Alexandra or Gábor quite as well, but they seem driven and bring diverse experience.
I think that the marginal value of my grant to Catalyze is very high at the moment. Catalyze are currently putting together funding proposals for their first incubator program and I suspect that their previous Lightspeed funding might run low before they receive confirmation from other funders.

Donor's main reservations

Alexandra and Kay do not have significant experience in founding/growing organizations and none of the core team seem to have significant experience with AI safety grantmaking or cause prioritization. However, I believe that Gábor brings significant entrepeneurial experience, and Jan-Willem and I, as advisory board members, bring significant additional experience in applicant selection. I don't see anyone else lining up to produce an AI safety org incubator and I think Alexandra, Kay, and Gábor have a decent chance at succeeding. Regardless, I recommend that Catalyze recruit another advisory board member with significant AI safety grantmaking experience to aid in applicant/project selection.
It's possible that Catalyze's incubator program helps further projects that contribute disproportionally to AI capabilities advances. I recommend that Catalyze consider the value alignment of participants and the capabilities-alignment tradeoff of projects during selection and incubation. Additionally, it would be ideal if Catalyze sought an additional advisory board member with significant experience in evaluating dual-use AI safety research.
There might not be enough high-level AI safety research talent available to produce many viable AI safety research organizations right away. I recommend that Catalyze run a MVP incubator program to assess the quality of founders/projects, including funder and VC interest, before investing in a large program.

Process for deciding amount

Alexandra said that $5k gives Catalyze one month of runway, so $15k gives them three months runway. I think that three months is more than sufficient time for Catalyze to receive funding from a larger donor and plan an MVP incubator program. I don't want Catalyze to fail because of short-term financial instability.

Conflicts of interest

I am an unpaid advisor to Catalyze. I will not accept any money for this role.
Kay was a scholar in MATS, the program I co-lead. Additionally, I expect that many potential participants in Catalyze's incubator programs will be MATS alumni. Part of MATS' theory of change is to aid the creation of further AI safety organizations and funders may assess MATS' impact on the basis of alumni achievements.
Catalyze wants to hold their incubator program at LISA, an office that I co-founded and at which remain a Board Member. However, I currently receive no income from LISA and, as a not-for-profit entity, I have no direct financial stake in LISA's success. However, I obviously want LISA to succeed and believe that a potential collaboration with Catalyze might be beneficial.

My donation represents my personal views and in no way constitutes an endorsement by MATS or LISA.

AI Safety Research Organization Incubator - Pilot Program

Ryan Kidd

about 2 years ago

How useful is $5-10k? What impact does this buy on the margin currently?

MATS Program

Ryan Kidd

about 2 years ago

@trishume Woohoo! We can support an additional ~7 scholars with this grant, based on the updated marginal cost of $21k/scholar.

MATS Program

Ryan Kidd

about 2 years ago

Update update: Several more awesome mentors have come forward and we now are funding constrained again for Winter!

MATS Program

Ryan Kidd

about 2 years ago

Update: we don't appear to be funding constrained for Winter, but will continue accepting donations for our Summer 2024 Program!

Scoping Developmental Interpretability

Ryan Kidd

over 2 years ago

Main points in favor of this grant

Developmental interpretability seems like a potentially promising and relatively underexplored research direction for exploring neural network generalization and inductive biases. Hopefully, this research can complement low-level or probe-based approaches for neural network interpretability and eventually help predict, explain, and steer dangerous AI capabilities such as learned optimization and deceptive alignment.
Jesse made a strong, positive impression on me as a scholar in the SERI MATS Winter 2022-23 Cohort; his research was impressive and he engaged well with criticism and others scholars' diverse research projects. His mentor, Evan Hubinger, endorsed his research at the time and obviously continues to do, as indicated by his recent regrant. While Jesse is relatively young to steer a research team, he has strong endorsements and support from Dan Murfet, David Krueger, Evan Hubinger, and other researchers, and has displayed impressive enterpeneurship in launching Timaeus and organizing the SLT summits.
I recently met Dan Murfet at EAGxAustralia 2023 and was impressed by his research presentation skills, engagement with AI safety, and determination to build the first dedicated academic AI safety lab in Australia. Dan seems like a great research lead for the University of Melbourne lab, where much of this research will be based.
Australia has produced many top ML and AI safety researchers, but has so far lacked a dedicated AI safety organization to leverage local talent. I believe that we need more AI safety hubs, especially in academic institutions, and I see Timaeus (although remote) and the University of Melbourne as strong contenders.
Developmental interpretability seems like an ideal research vehicle to leverage underutilized physics and mathematics talent for AI safety. Jesse is a former physicist and Dan is a mathematician who previously specialized in algebraic geometry. In my experience as Co-Director of MATS, I have realized that many former physicists and mathematicians are deeply interested in AI safety, but lack a transitionary route to adapt their skills to the challenge.
Other funders (e.g., Open Phil, SFF) seem more reluctant (or at least slower) to fund this project than Manifund or Lightspeed and Jesse/Dan told me that they would need more funds within a week if they were going to hire another RA. I believe that this $20k is a high-expected value investment in reducing the stress associated with founding a potentially promising new AI safety organization and will allow Jesse/Dan to produce more exploratory research early to ascertain the value of SLT for AI safety.

Donor's main reservations

I have read several of Jesse's and Dan's posts about SLT and Dev Interp and watched several of their talks, but still feel that I don't entirely grasp the research direction. I could spend further time on this, but I feel more than confident enough to recommend $20k.
Jesse is relatively young to run a research organization and Dan is relatively new to AI safety research; however, they seem more than capable for my level of risk tolerance with $20k, even with my current $50k pot.
The University of Melbourne may not be an ideal (or supportive) home for this research team; however, Timaeus already plans to be somewhat remote and several fiscal sponsors (e.g., Rethink Priorities Special Projects, BERI, Ashgro) would likely be willing to support their researchers.

Process for deciding amount

I chose to donate $20k because Jesse said that a single paper would cost $40k (roughly 1 RA-year) and my budget is limited. I encourage further regrantors to join me and fund another half-paper!

Conflicts of interest

Jesse was a scholar in the program I co-lead, but I do not believe that this constitutes a conflict of interest.

Continued funding for a PhD in AI x-risk decision and risk analysis

Ryan Kidd

over 2 years ago

@alenglander, when do you expect to hear back from the LTFF? Was the Nonlinear Network funding successful?

Transactions

For	Date	Type	Amount
Isolating CBRN Knowledge in LLMs for Safety - Phase 2 (Research)	11 days ago	project donation	2500
Reducing Risk in AI Safety Through Expanding Capacity.	11 days ago	project donation	10000
Retroactive: Presenting a poster at the ICML technical AI governance workshop	about 1 month ago	project donation	1300
CaML - AGI alignment to nonhumans	about 2 months ago	project donation	10000
<02be5f43-1129-4025-b752-8127a793fd82>	2 months ago	profile donation	+1000
Guaranteed Safe AI Seminars 2026	2 months ago	project donation	2000
Inspect Evals	2 months ago	project donation	19997
<2af62d2b-2551-4164-910b-2f388fbe03c0>	2 months ago	tip	1
The Deal of the Century (for AI)	3 months ago	project donation	10000
Enabling a Student to Present his Mechanistic Interpretability Work at NeurIPS	3 months ago	project donation	900
Emergency travel funding to attend EA Global: New York 2025	3 months ago	project donation	500
Grow An AI Safety Tiktok Channel To Reach Ten Million People	4 months ago	project donation	1000
AI Safety Los Angeles (AISLA)	5 months ago	project donation	2500
<fd6074be-055a-4188-bdf7-0297f81126fc>	5 months ago	tip	1
Acausal research and interventions	5 months ago	project donation	10000
MATS Program	6 months ago	project donation	+10
TransformerLens - Bridge Funding	6 months ago	project donation	3000
Keep Apart Research Going: Global AI Safety Research & Talent Pipeline	7 months ago	project donation	5000
Keep Apart Research Going: Global AI Safety Research & Talent Pipeline	7 months ago	project donation	5000
Manifund Bank	7 months ago	deposit	+100000
<dcc2d463-7ff8-4e3b-a0fb-184cf447b80f>	8 months ago	tip	1
AI forecasting and policy research by the AI 2027 team	8 months ago	project donation	5000
Manifund Bank	8 months ago	withdraw	400
Coordinal Research: Accelerating the research of safely deploying AI systems.	8 months ago	project donation	10000
Develop technical framework for human control mechanisms for agentic AI systems	8 months ago	project donation	10000
Out of This Box: AI Safety Musical	8 months ago	project donation	6500
'Making God': a Documentary on AI Risks for the Public	9 months ago	project donation	5000
AI Governance Exchange (focus on China, AI safety), Seed Funding	9 months ago	project donation	12000
Luthien	9 months ago	project donation	20000
[Urgent] Top-up funding to present poster at the Tokyo AI Safety Conference	9 months ago	project donation	600
MATS Program	10 months ago	project donation	+200
Bridge Funding for the Sydney AI Safety Hub (SASH)	11 months ago	project donation	5167
MATS Program	11 months ago	project donation	+200
Manifund Bank	12 months ago	withdraw	10000
MATS Program	about 1 year ago	project donation	+10000
<02be5f43-1129-4025-b752-8127a793fd82>	about 1 year ago	profile donation	+100
Help launch the Technical Alignment Research Accelerator (TARA)!	about 1 year ago	project donation	14901
Metaculus x Givewell Forecasting Tournament	about 1 year ago	project donation	250
Manifund Bank	about 1 year ago	withdraw	8365
AI Animals and Digital Minds 2025	about 1 year ago	project donation	7500
Shallow review of AI safety 2024	about 1 year ago	project donation	1000
Finishing The SB-1047 Documentary	about 1 year ago	project donation	6000
Fund Sentinel for Q1-2025	about 1 year ago	project donation	5000
MATS Program	about 1 year ago	project donation	+140
Athena 2.0	about 1 year ago	project donation	8000
Animal Advocacy Strategy Forum	about 1 year ago	project donation	1000
Making 52 AI Alignment Video Explainers and Podcasts	about 1 year ago	project donation	4000
Developing a Course on AI x-risk	about 1 year ago	project donation	5000
MATS Program	about 1 year ago	project donation	+50
Testing and spreading messages to reduce AI x-risk	over 1 year ago	project donation	1529
Travel funding to the International Conference on Algorithmic Decision Theory.	over 1 year ago	project donation	820
<10bd8a14-4002-47ff-af4a-92b227423a74>	over 1 year ago	tip	+10
Calibration City	over 1 year ago	project donation	2000
MATS Program	over 1 year ago	project donation	+1501
PIBBSS - General Programs funding or specific funding	over 1 year ago	project donation	25000
MATS Program	over 1 year ago	project donation	+50
MATS Program	over 1 year ago	project donation	+25
MATS Program	over 1 year ago	project donation	+50
MATS Program	over 1 year ago	project donation	+2000
MATS Program	over 1 year ago	project donation	+10
MATS Program	over 1 year ago	project donation	+10
MATS Program	over 1 year ago	project donation	+50
MATS Program	over 1 year ago	project donation	+50
MATS Program	over 1 year ago	project donation	+20
MATS Program	over 1 year ago	project donation	+10
MATS Program	over 1 year ago	project donation	+100
MATS Program	over 1 year ago	project donation	+50
MATS Program	over 1 year ago	project donation	+50
MATS Program	over 1 year ago	project donation	+100
MATS Program	over 1 year ago	project donation	+113
Making 52 AI Alignment Video Explainers and Podcasts	over 1 year ago	project donation	600
MATS Program	over 1 year ago	project donation	+170
Manifund Bank	over 1 year ago	deposit	+600
MATS Program	over 1 year ago	project donation	+100
MATS Program	over 1 year ago	project donation	+200
MATS Program	over 1 year ago	project donation	+66
MATS Program	over 1 year ago	project donation	+50
AI Policy Breakthroughs — Empowering Insiders	over 1 year ago	project donation	20000
AI-Driven Market Alternatives for a post-AGI world	over 1 year ago	project donation	5000
Graduate School Application Fee for Students from Third World Country	over 1 year ago	project donation	500
Preventing Worst Case Pandemics Symposium @ Cambridge	over 1 year ago	project donation	2000
Preventing Worst Case Pandemics Symposium @ Cambridge	over 1 year ago	project donation	2000
AI, Animals, and Digital Minds 2024 Conference and Retreat	over 1 year ago	project donation	1500
Evaluating the Effectiveness of Unlearning Techniques	over 1 year ago	project donation	10000
MATS Program	over 1 year ago	project donation	+2000
MATS Program	over 1 year ago	project donation	+1000
AI Safety Textbook	over 1 year ago	project donation	39000
MATS Program	over 1 year ago	project donation	+400
Manifund Bank	over 1 year ago	withdraw	81040
MATS Program	over 1 year ago	project donation	+1040
MATS Program	over 1 year ago	project donation	+80000
Manifund Bank	over 1 year ago	deposit	+250000
AI Safety Research Organization Incubator - Pilot Program	almost 2 years ago	project donation	15000
Help Apart Expand Global AI Safety Research	almost 2 years ago	project donation	5000
Manifund Bank	almost 2 years ago	withdraw	190178
AI Policy work @ IAPS	almost 2 years ago	project donation	5000
Cadenza Labs: AI Safety research group working on own interpretability agenda	about 2 years ago	project donation	5000
MATS Program	about 2 years ago	project donation	+14000
MATS Program	about 2 years ago	project donation	+134
MATS Program	about 2 years ago	project donation	+1211
MATS Program	about 2 years ago	project donation	+17533
MATS Program	about 2 years ago	project donation	+6000
MATS Program	about 2 years ago	project donation	+500
MATS Program	about 2 years ago	project donation	+150000
MATS Program	about 2 years ago	project donation	+300
MATS Program	about 2 years ago	project donation	+500
Scoping Developmental Interpretability	over 2 years ago	project donation	20000
Manifund Bank	over 2 years ago	deposit	+50000

Donate

Projects

Outgoing donations

Comments

Main points in favor of this grant

Donor's main reservations

Process for deciding amount

Conflicts of interest

Main points in favor of this grant

Donor's main reservations

Process for deciding amount

Conflicts of interest

Main points in favor of this grant

Donor's main reservations

Process for deciding amount

Conflicts of interest

Main points in favor of this grant

Donor's main reservations

Process for deciding amount

Conflicts of interest

Main points in favor of this grant

Donor's main reservations

Process for deciding amount

Conflicts of interest

Main points in favor of this grant

Donor's main reservations

Process for deciding amount

Conflicts of interest

Main points in favor of this grant

Donor's main reservations

Process for deciding amount

Conflicts of interest

Main points in favor of this grant

Donor's main reservations

Process for deciding amount

Conflicts of interest

Main points in favor of this grant

Donor's main reservations

Process for deciding amount

Conflicts of interest

Main points in favor of this grant

Donor's main reservations

Process for deciding amount

Conflicts of interest

Transactions