Wes RothCivilisational risk and strategySpotlightReleased: 3 Apr 2025
OpenAI's Autonomous AI Research Benchmark
Why this matters
Keeps evals discussion grounded in a currently available benchmark-focused source.
Summary
Focused coverage of autonomous research benchmarks and what they imply for capability claims.
Editor note
Refreshed to a live Wes channel source for reliable on-site playback.
ai-safetywes-rothevals
Play on sAIfe Hands