Manage episode 522231585 series 3647399
Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required..
Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.
In this edition we discuss the new AI Dashboard, recent frontier models from Google and Anthropic, and a revived push to preempt state AI regulations.
Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts.
CAIS Releases the AI Dashboard for Frontier Performance
CAIS launched its AI Dashboard, which evaluates frontier AI systems on capability and safety benchmarks. The dashboard also tracks the industry's overall progression toward broader milestones such as AGI, automation of remote labor, and full self-driving.
How the dashboard works. The AI Dashboard features three leaderboards—one for text, one for vision, and one for risks—where frontier models are ranked according to their average score across a battery of benchmarks. Because CAIS evaluates models directly across a wide range of tasks, the dashboard provides apples-to-apples comparisons of how different frontier models perform on the same set of evaluations and safety-relevant behaviors.
Ranking frontier models for [...]
---
Outline:
(00:33) CAIS Releases the AI Dashboard for Frontier Performance
(04:05) Politicians Revive Push for Moratorium on State AI Laws
(06:39) Gemini 3 Pro and Claude Opus 4.5 Arrive
(09:17) In Other News
(09:20) Government
(10:15) Industry
(11:03) Civil Society
(12:00) Discussion about this post
---
First published:
December 2nd, 2025
Source:
https://newsletter.safe.ai/p/ai-safety-newsletter-66-aisn-66-evaluating
---
Want more? Check out our ML Safety Newsletter for technical safety research.
Narrated by TYPE III AUDIO.
---
Images from the article:



Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
72 episodes