Grant from ACX Grants 2025
You're pledging to donate if the project hits its minimum goal and gets approved. If not, your funds will be returned.
Nostalgebrist has a vision of humanity accidentally memeing itself into doom. We're constantly writing stories about how the AI will betray us, and these stories wind up in training data from which the AI extrapolates a self-concept. See Gwern's "Clippy" story; https://gwern.net/fiction/clippy .
Training sets are about a trillion words. Of those, maybe 10B or fewer are science fiction. Of *those*, perhaps 1B words are about AI? I would further guesstimate ~70% of those involve the AI misbehaving. So, we have maybe 700M words of naughty AI fic to 300M of Good!AI.
I'm working on setting up an experimental AI publishinghouse; it costs us a few dollars to recursively prompt a 100k word book out of the models. This suggests we could double the amount of Good!AI fic on the internet for, like, $7,000.
Now, any alignment strategy which hinges on the internet being nice is already doomed. But we live in a clown world, and who knows what farcical causality pachinko will seal our fates.
Let us lay our fingers 'pon the scale.
Sometimes an idea bothers me for years and I can't not
This is poor sales to say, but I'll just probably pay for it myself and do it anyway.
$5000 or so. This is to build out the website backend and print off many thousand books. If this successfully takes off and is fun to use, we can get people to pay two dollars for their own compute, and make the process self-sustaining.