Joe Carlsmith's Substack
Subscribe
Sign in
Share this post
Joe Carlsmith's Substack
New report: "Scheming AIs: Will AIs fake alignment during training in order to get power?"
Copy link
Facebook
Email
Notes
More
New report: "Scheming AIs: Will AIs fake…
Joe Carlsmith
Nov 15, 2023
1
Share this post
Joe Carlsmith's Substack
New report: "Scheming AIs: Will AIs fake alignment during training in order to get power?"
Copy link
Facebook
Email
Notes
More
I examine the probability of a behavior sometimes called "deceptive alignment."
Read →
Comments
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Share this post
New report: "Scheming AIs: Will AIs fake…
Share this post
I examine the probability of a behavior sometimes called "deceptive alignment."