Better version of the Twitter newsletter. Bridging the Human-AI Knowledge Gap: Concept Discovery and Transfer in AlphaZero Superhuman AI will use concepts and abstractions that are not part of human knowledge. To supervise those AIs, we need to understand those concepts.
November/December 2023 safety news: Weak-to-strong generalization, Superhuman concepts, Google-proof benchmark
November/December 2023 safety news…
November/December 2023 safety news: Weak-to-strong generalization, Superhuman concepts, Google-proof benchmark
Better version of the Twitter newsletter. Bridging the Human-AI Knowledge Gap: Concept Discovery and Transfer in AlphaZero Superhuman AI will use concepts and abstractions that are not part of human knowledge. To supervise those AIs, we need to understand those concepts.