Archive - Joe Carlsmith's Substack

Controlling the options AIs can pursue

On blocking paths to power, and on making deals.

Sep 29 •

Video and transcript of talk on giving AIs safe motivations

From a talk at UT Austin in September 2025.

Sep 22 •

August 2025

Giving AIs safe motivations

A four-part picture.

Aug 18 •

July 2025

Video and transcript of talk on "Can goodness compete?"

From a public talk on long-term equilibria post-AGI, given at Mox in SF in July 2025.

Jul 17 •

May 2025

Video and transcript of talk on AI welfare

An overview of my take on AI welfare as of May 2025, from a talk at Anthropic.

May 22 •

The stakes of AI moral status

On seeing and not seeing souls.

May 21

April 2025

Video and transcript of talk on automating alignment research

From a talk at Anthropic in April 2025

Apr 30 •

Can we safely automate alignment research?

It's really important; we have a real shot; there are a lot of ways we can fail.

Apr 30 •

March 2025

AI for AI safety

We should try extremely hard to use AI labor to help address the alignment problem.

Mar 14 •

Paths and waystations in AI safety

On the structure of the path to safe superintelligence, and some possible milestones along the way.

Mar 11 •

February 2025

When should we worry about AI power-seeking?

Examining the conditions required for rogue AI behavior.

Feb 19 •

What is it to solve the alignment problem?

Also: to avoid it? Handle it? Solve it forever? Solve it completely?

Feb 13 •

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts