AI safety takes

Share this post

User's avatar
AI safety takes
November/December 2023 safety news: Weak-to-strong generalization, Superhuman concepts, Google-proof benchmark
Copy link
Facebook
Email
Notes
More

November/December 2023 safety news…

Daniel Paleka
Dec 27, 2023
5

Share this post

User's avatar
AI safety takes
November/December 2023 safety news: Weak-to-strong generalization, Superhuman concepts, Google-proof benchmark
Copy link
Facebook
Email
Notes
More

Better version of the Twitter newsletter.

Read →
Comments
User's avatar
© 2025 Daniel Paleka
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share

Copy link
Facebook
Email
Notes
More