AI safety takes

Share this post

User's avatar
AI safety takes
November 2022 safety news: Mode collapse in InstructGPT, Adversarial Go
Copy link
Facebook
Email
Notes
More

November 2022 safety news: Mode collapse in…

Daniel Paleka
Dec 1, 2022

Share this post

User's avatar
AI safety takes
November 2022 safety news: Mode collapse in InstructGPT, Adversarial Go
Copy link
Facebook
Email
Notes
More

Better version of the monthly Twitter thread.

Read →
Comments
User's avatar
© 2025 Daniel Paleka
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share

Copy link
Facebook
Email
Notes
More