AI safety takes
Subscribe
Sign in
Share this post
AI safety takes
September/October 2023 safety news: Sparse autoencoders, A is B is not B is A, Image hijacks
Copy link
Facebook
Email
Notes
More
September/October 2023 safety news: Sparse…
Daniel Paleka
Oct 17, 2023
3
Share this post
AI safety takes
September/October 2023 safety news: Sparse autoencoders, A is B is not B is A, Image hijacks
Copy link
Facebook
Email
Notes
More
Better version of the Twitter thread.
Read →
Comments
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Share this post
September/October 2023 safety news: Sparse…
Share this post
Better version of the Twitter thread.