Artwork

DeepSeek Minisode

muckrAIkers

published

iconShare
 
Manage episode 465991663 series 3602894
Content provided by Jacob Haimes and Igor Krawczuk. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Jacob Haimes and Igor Krawczuk or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://staging.podcastplayer.com/legal.

DeepSeek R1 has taken the world by storm, causing a stock market crash and prompting further calls for export controls within the US. Since this story is still very much in development, with follow-up investigations and calls for governance being released almost daily, we thought it best to hold of for a little while longer to be able to tell the whole story. Nonetheless, it's a big story, so we provide a brief overview of all that's out there so far.

  • (00:00) - Recording date
  • (00:04) - Intro
  • (00:37) - DeepSeek drop and reactions
  • (04:27) - Export controls
  • (08:05) - Skepticism and uncertainty
  • (14:12) - Outro


Links
  • DeepSeek website
  • DeepSeek paper
  • Reuters article - What is DeepSeek and why is it disrupting the AI sector?

Fallout coverage

  • The Verge article - OpenAI has evidence that its models helped train China’s DeepSeek
  • The Signal article - Nvidia loses nearly $600 billion in DeepSeek crash
  • CNN article - US lawmakers want to ban DeepSeek from government devices
  • Fortune article - Meta is reportedly scrambling ‘war rooms’ of engineers to figure out how DeepSeek’s AI is beating everyone else at a fraction of the price
  • Dario Amodei's blogpost - On DeepSeek and Export Controls
  • SemiAnalysis article - DeepSeek Debates
  • Ars Technica article - Microsoft now hosts AI model accused of copying OpenAI data
  • Wiz Blogpost - Wiz Research Uncovers Exposed DeepSeek Database Leaking Sensitive Information, Including Chat History

Investigations into "reasoning"

  • Blogpost - There May Not be Aha Moment in R1-Zero-like Training — A Pilot Study
  • Preprint - s1: Simple test-time scaling
  • Preprint - LIMO: Less is More for Reasoning
  • Blogpost - Reasoning Reflections
  • Preprint - Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH
  continue reading

18 episodes