Search a title or topic

Over 20 million podcasts, powered by 

Player FM logo
Artwork

Content provided by Craig S. Smith. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Craig S. Smith or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://staging.podcastplayer.com/legal.
Player FM - Podcast App
Go offline with the Player FM app!

#239 Tuhin Srivatsa: How Baseten is Disrupting AI Deployment & Scaling in 2025

46:17
 
Share
 

Manage episode 468574200 series 2455219
Content provided by Craig S. Smith. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Craig S. Smith or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://staging.podcastplayer.com/legal.

This episode is sponsored by Thuma.

Thuma is a modern design company that specializes in timeless home essentials that are mindfully made with premium materials and intentional details.

To get $100 towards your first bed purchase, go to http://thuma.co/eyeonai

—————————————————————————————————————————

AI deployment is broken—can it be fixed? In this episode, Tuhin Srivatsa, CEO & Co-Founder of Baseten, reveals how his company is DISRUPTING AI infrastructure, making it easier, faster, and more cost-effective to deploy and scale AI models in production.

As enterprises increasingly turn to open-source AI models and grapple with the high costs and complexity of scaling, Baseten offers a game-changing solution that eliminates bottlenecks and simplifies the process. Discover how Baseten is taking on AWS SageMaker, OpenAI, and cloud-based AI deployment platforms to reshape the future of AI model deployment.

What You’ll Learn in This Episode:
  • Why AI deployment & scaling is one of the biggest challenges in 2025

  • How Baseten enables enterprises to run AI models faster & more efficiently

  • The shift from closed-source to open-source AI models—and why it matters

  • The hidden costs of AI inference & how to optimize for performance

  • Why most AI models fail in production and how to prevent it

  • The future of AI infrastructure: What comes next for scalable AI

Whether you’re a machine learning engineer, AI researcher, startup founder, or enterprise leader, this episode is packed with actionable insights to help you scale AI models without the headaches.

Don’t miss this conversation on the next era of AI deployment!

#AI #ArtificialIntelligence #MachineLearning #Baseten #AIDeployment #AIScaling #Inference #MLInfrastructure #TechPodcast

Stay Updated:

Craig Smith Twitter: https://twitter.com/craigss

Eye on A.I. Twitter: https://twitter.com/EyeOn_AI

—————————————————————————————————————————

(00:00) Tuhin Srivatsa’s Journey in AI & Baseten

(01:50) What is AI Infrastructure & Why It Matters

(03:30) How Baseten Optimizes AI Model Deployment

(05:19) Why Most AI Deployments Fail (And How to Fix It)

(09:17) The Future of Open-Source AI Models in Enterprise

(11:01) How Baseten Automates AI Scaling & Inference

(14:12) Why AI Developers Struggle with Cloud-Based AI Tools

(18:47) The Real Cost of AI Inference (And How to Reduce It)

(20:44) Why AI Scaling is the Biggest Challenge in 2025

(26:55) Can AI Run on Non-NVIDIA Chips? (The Hardware Debate)

(31:23) The Future of AI Model Deployment & Inference

(37:05) How AI Agents & Reasoning Models Are Changing the Game

(40:39) The Truth About AI Hype vs. Reality

(45:04) How to Get Started with Baseten

(45:48) The Future of AI Infrastructure

  continue reading

253 episodes

Artwork
iconShare
 
Manage episode 468574200 series 2455219
Content provided by Craig S. Smith. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Craig S. Smith or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://staging.podcastplayer.com/legal.

This episode is sponsored by Thuma.

Thuma is a modern design company that specializes in timeless home essentials that are mindfully made with premium materials and intentional details.

To get $100 towards your first bed purchase, go to http://thuma.co/eyeonai

—————————————————————————————————————————

AI deployment is broken—can it be fixed? In this episode, Tuhin Srivatsa, CEO & Co-Founder of Baseten, reveals how his company is DISRUPTING AI infrastructure, making it easier, faster, and more cost-effective to deploy and scale AI models in production.

As enterprises increasingly turn to open-source AI models and grapple with the high costs and complexity of scaling, Baseten offers a game-changing solution that eliminates bottlenecks and simplifies the process. Discover how Baseten is taking on AWS SageMaker, OpenAI, and cloud-based AI deployment platforms to reshape the future of AI model deployment.

What You’ll Learn in This Episode:
  • Why AI deployment & scaling is one of the biggest challenges in 2025

  • How Baseten enables enterprises to run AI models faster & more efficiently

  • The shift from closed-source to open-source AI models—and why it matters

  • The hidden costs of AI inference & how to optimize for performance

  • Why most AI models fail in production and how to prevent it

  • The future of AI infrastructure: What comes next for scalable AI

Whether you’re a machine learning engineer, AI researcher, startup founder, or enterprise leader, this episode is packed with actionable insights to help you scale AI models without the headaches.

Don’t miss this conversation on the next era of AI deployment!

#AI #ArtificialIntelligence #MachineLearning #Baseten #AIDeployment #AIScaling #Inference #MLInfrastructure #TechPodcast

Stay Updated:

Craig Smith Twitter: https://twitter.com/craigss

Eye on A.I. Twitter: https://twitter.com/EyeOn_AI

—————————————————————————————————————————

(00:00) Tuhin Srivatsa’s Journey in AI & Baseten

(01:50) What is AI Infrastructure & Why It Matters

(03:30) How Baseten Optimizes AI Model Deployment

(05:19) Why Most AI Deployments Fail (And How to Fix It)

(09:17) The Future of Open-Source AI Models in Enterprise

(11:01) How Baseten Automates AI Scaling & Inference

(14:12) Why AI Developers Struggle with Cloud-Based AI Tools

(18:47) The Real Cost of AI Inference (And How to Reduce It)

(20:44) Why AI Scaling is the Biggest Challenge in 2025

(26:55) Can AI Run on Non-NVIDIA Chips? (The Hardware Debate)

(31:23) The Future of AI Model Deployment & Inference

(37:05) How AI Agents & Reasoning Models Are Changing the Game

(40:39) The Truth About AI Hype vs. Reality

(45:04) How to Get Started with Baseten

(45:48) The Future of AI Infrastructure

  continue reading

253 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Listen to this show while you explore
Play