AI safety takes
Subscribe
Sign in
Share this discussion
September/October 2023 safety news: Sparse autoencoders, A is B is not B is A, Image hijacks
newsletter.danielpaleka.com
Copy link
Facebook
Email
Note
Other
September/October 2023 safety news: Sparse…
Daniel Paleka
Oct 17, 2023
3
Share this post
September/October 2023 safety news: Sparse autoencoders, A is B is not B is A, Image hijacks
newsletter.danielpaleka.com
Copy link
Facebook
Email
Note
Other
Better version of the Twitter thread.
Read →
Comments
Share
Share
Copy link
Facebook
Email
Note
Other
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
September/October 2023 safety news: Sparse autoencoders, A is B is not B is A, Image hijacks
September/October 2023 safety news: Sparse…
September/October 2023 safety news: Sparse autoencoders, A is B is not B is A, Image hijacks
Better version of the Twitter thread.