Artwork
iconShare
 
Manage episode 522231585 series 3647399
Content provided by Center for AI Safety. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Center for AI Safety or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://staging.podcastplayer.com/legal.

Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required..

Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.

In this edition we discuss the new AI Dashboard, recent frontier models from Google and Anthropic, and a revived push to preempt state AI regulations.

Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts.

CAIS Releases the AI Dashboard for Frontier Performance

CAIS launched its AI Dashboard, which evaluates frontier AI systems on capability and safety benchmarks. The dashboard also tracks the industry's overall progression toward broader milestones such as AGI, automation of remote labor, and full self-driving.

How the dashboard works. The AI Dashboard features three leaderboards—one for text, one for vision, and one for risks—where frontier models are ranked according to their average score across a battery of benchmarks. Because CAIS evaluates models directly across a wide range of tasks, the dashboard provides apples-to-apples comparisons of how different frontier models perform on the same set of evaluations and safety-relevant behaviors.

Ranking frontier models for [...]

---

Outline:

(00:33) CAIS Releases the AI Dashboard for Frontier Performance

(04:05) Politicians Revive Push for Moratorium on State AI Laws

(06:39) Gemini 3 Pro and Claude Opus 4.5 Arrive

(09:17) In Other News

(09:20) Government

(10:15) Industry

(11:03) Civil Society

(12:00) Discussion about this post

---

First published:
December 2nd, 2025

Source:
https://newsletter.safe.ai/p/ai-safety-newsletter-66-aisn-66-evaluating

---

Want more? Check out our ML Safety Newsletter for technical safety research.

Narrated by TYPE III AUDIO.

---

Images from the article:

Graph showing AI model performance over time titled
Table showing AI model performance scores across reasoning, coding, and gaming benchmarks.
Bar chart titled

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

  continue reading

72 episodes