181 | LW - If You Can Climb Up, You Can Climb Down by jefftk 2024-07-31 |
Play |
182 | LW - Open Source Automated Interpretability for Sparse Autoencoder Features by kh4dien 2024-07-31 |
Play |
183 | LW - Twitter thread on AI safety evals by Richard Ngo 2024-07-31 |
Play |
184 | LW - RTFB: California's AB 3211 by Zvi 2024-07-30 |
Play |
185 | LW - Self-Other Overlap: A Neglected Approach to AI Alignment by Marc Carauleanu 2024-07-30 |
Play |
186 | LW - Understanding Positional Features in Layer 0 SAEs by bilalchughtai 2024-07-30 |
Play |
187 | LW - This is already your second chance by Malmesbury 2024-07-28 |
Play |
188 | LW - Unlocking Solutions by James Stephen Brown 2024-07-28 |
Play |
189 | LW - Re: Anthropic's suggested SB-1047 amendments by RobertM 2024-07-27 |
Play |
190 | LW - Safety consultations for AI lab employees by Zach Stein-Perlman 2024-07-27 |
Play |