Search a title or topic

Over 20 million podcasts, powered by 

Player FM logo

Llmops Podcasts

show episodes
 
In 2023, ChatGPT put AI on everyone’s agenda. In 2024, the challenge will be turning those agendas into reality. In Generative AI in the Real World, Ben Lorica interviews leaders who are building with AI. Learn from their experience to help put AI to work in your enterprise.
  continue reading
 
Loading …
show series
 
MLOps is dead. Well, not really, but for many the job is evolving into LLMOps. In this episode, Abide AI founder and LLMOps author Abi Aryan joins Ben to discuss what LLMOps is and why it’s needed, particularly for agentic AI systems. Listen in to hear why LLMOps requires a new way of thinking about observability, why we should spend more time unde…
  continue reading
 
In this episode, Laurence Moroney, director of AI at Arm, joins Ben Lorica to chat about the state of deep learning frameworks—and why you may be better off thinking a step higher, on the solution level. Listen in for Laurence’s thoughts about posttraining; the evolution of on-device AI (and how tools like ExecuTorch and LiteRT are helping make it …
  continue reading
 
In this episode, Laurence Moroney, director of AI at Arm, joins Ben Lorica to chat about the state of deep learning frameworks—and why you may be better off thinking a step higher, on the solution level. Listen in for Laurence’s thoughts about posttraining; the evolution of on-device AI (and how tools like ExecuTorch and LiteRT […]…
  continue reading
 
In this episode, Ben Lorica and Chris Butler, director of product operations for GitHub's Synapse team, chat about the experimentation Chris is doing to incorporate generative AI into the product development process—particularly with the goal of reducing toil for cross-functional teams. It isn’t just automating busywork (although there’s some of th…
  continue reading
 
In this episode, Ben Lorica and Chris Butler, director of product operations for GitHub’s Synapse team, chat about the experimentation Chris is doing to incorporate generative AI into the product development process—particularly with the goal of reducing toil for cross-functional teams. It isn’t just automating busywork (although there’s some of th…
  continue reading
 
In this episode, Ben Lorica and Drew Breunig, a strategist at the Overture Maps Foundation, talk all things context engineering: what’s working, where things are breaking down, and what comes next. Listen in to hear why huge context windows aren’t solving the problems we hoped they might, why companies shouldn’t discount evals and testing, and why …
  continue reading
 
In this episode, Ben Lorica and Drew Breunig, a strategist at the Overture Maps Foundation, talk all things context engineering: what’s working, where things are breaking down, and what comes next. Listen in to hear why huge context windows aren’t solving the problems we hoped they might, why companies shouldn’t discount evals and testing, and […]…
  continue reading
 
In this episode, Ben Lorica and Anthropic interpretability researcher Emmanuel Ameisen get into the work Emmanuel’s team has been doing to better understand how LLMs like Claude work. Listen in to find out what they’ve uncovered by taking a microscopic look at how LLMs function—and just how far the analogy to the human brain holds.…
  continue reading
 
In this episode, Ben Lorica and Anthropic interpretability researcher Emmanuel Ameisen get into the work Emmanuel’s team has been doing to better understand how LLMs like Claude work. Listen in to find out what they’ve uncovered by taking a microscopic look at how LLMs function—and just how far the analogy to the human brain holds. […]…
  continue reading
 
Everyone is talking about agents: single agents and, increasingly, multi-agent systems. What kind of applications will we build with agents, and how will we build with them? How will agents communicate with each other effectively? Why do we need a protocol like A2A to specify how they communicate? Join Ben Lorica as he talks with Heiko Hotz and Sok…
  continue reading
 
In this episode, Ben Lorica and AI Engineer Faye Zhang talk about discoverability: how to use AI to build search and recommendation engines that actually find what you want. Listen in to learn how AI goes way beyond simple collaborative filtering—pulling in many different kinds of data and metadata, including images and voice, to get a much better …
  continue reading
 
Join Luke Wroblewski and Ben Lorica as they talk about the future of software development. What happens when we have databases that are designed to interact with agents and language models rather than humans? We’re starting to see what that world will look like. It’s an exciting time to be a software developer.…
  continue reading
 
Jay Alammar, director and Engineering Fellow at Cohere, joins Ben Lorica to talk about building AI applications for the enterprise, using RAG effectively, and the evolution of RAG into agents. Listen in to find out what kinds of metadata you need when you’re onboarding a new model or agent; discover how an emphasis on evaluation helps an organizati…
  continue reading
 
Phillip Carter, formerly of Honeycomb, and Ben Lorica talk about observability and AI—what observability means, how generative AI causes problems for observability, and how generative AI can be used as a tool to help SREs analyze telemetry data. There’s tremendous potential because AI is great at finding patterns in massive datasets, but it’s still…
  continue reading
 
Key Argument Thesis: Using ELO for AI agent evaluation = measuring noise Problem: Wrong evaluators, wrong metrics, wrong assumptions Solution: Quantitative assessment frameworks The Comparison (00:00-02:00) Chess ELO FIDE arbiters: 120hr training Binary outcome: win/loss Test-retest: r=0.95 Cohen's κ=0.92 AI Agent ELO Random users: Google engineer?…
  continue reading
 
Audio is being added to AI everywhere: both in multimodal models that can understand and generate audio and in applications that use audio for input. Now that we can work with spoken language, what does that mean for the applications that we can develop? How do we think about audio interfaces—how will people use them, and what will they want to do?…
  continue reading
 
AI coding agents face the same fundamental limitation as parallel computing: Amdahl's Law. Just as 10 cooks can't make soup 10x faster, 10 AI agents can't code 10x faster due to inherent sequential bottlenecks. 📚 Key Concepts The Soup Analogy Multiple cooks can divide tasks (prep, boiling water, etc.) But certain steps MUST be sequential (can't sti…
  continue reading
 
How do you teach kids to use and build with AI? That’s what Stefania Druga works on. It’s important to be sensitive to their creativity, sense of fun, and desire to learn. When designing for kids, it’s important to design with them, not just for them. That’s a lesson that has important implications for adults, too. Join Stefania Druga and Ben Loric…
  continue reading
 
Join our host Ben Lorica and Douwe Kiela, cofounder of Contextual AI and author of the first paper on RAG, to find out why RAG remains as relevant as ever. Regardless of what you call it, retrieval is at the heart of generative AI. Find out why—and how to build effective RAG-based systems. Points of Interest 0:25: Today’s topic is RAG. With frontie…
  continue reading
 
Join Danielle Belgrave and Ben Lorica for a discussion of AI in healthcare. Danielle is VP of AI and machine learning at GSK (formerly GlaxoSmithKline). She and Ben discuss using AI and machine learning to get better diagnoses that reflect the differences between patients. Listen in to learn about the challenges of working with health data—a field …
  continue reading
 
Ben Lorica and Gabriela de Queiroz, director of AI at Microsoft, talk about startups: specifically, AI startups. How do you get noticed? How do you generate real traction? What are startups doing with agents and with protocols like MCP and A2A? And which security issues should startups watch for, especially if they’re using open weights models? Poi…
  continue reading
 
Join Steve Wilson and Ben Lorica for a discussion of AI security. We all know that AI brings new vulnerabilities into the software landscape. Steve and Ben talk about what makes AI different, what the big risks are, and how you can use AI safely. Find out how agents introduce their own vulnerabilities, and learn about resources such as OWASP that c…
  continue reading
 
Businesses have a lot of data—but most of that data is unstructured textual data: reports, catalogs, emails, notes, and much more. Without structure, business analysts can’t make sense of the data; there is value in the data, but it can’t be put to use. AI can be a tool for finding and extracting the structure that’s hidden in textual data. In this…
  continue reading
 
Ever since Andrej Karpathy first tweeted it, “vibe coding” has been on every software developer’s mind. Join Ben Lorica and Steve Yegge to find out what vibe coding means, especially in a professional context. Going beyond the current memes, what will the future of software development look like when we have multiple agents? And how do you prepare …
  continue reading
 
In this edition of Generative AI in the Real World, Ben Lorica and Rajeshwari Ganesan talk about how to put generative AI in closer touch with human needs and requirements. AI isn’t all about building bigger models and benchmarks. To use it effectively, we need better interfaces; we need contexts that support groups rather than individuals; we need…
  continue reading
 
In this episode, Ben Lorica and Hamel Husain talk about how to take the next steps with artificial intelligence. Developers don’t need to build their own models—but they do need basic data skills. It’s important to look at your data, to discover your model’s weaknesses, and to use that information to develop test suites and evals that show whether …
  continue reading
 
Join Shelby Heinecke, senior research manager at Salesforce, and Ben Lorica as they talk about agents, AI models that can take action on behalf of their users. Are they the future—or at least the hot topic for the coming year? Where are we with smaller models? And what do we need to improve the agent stack? How do you evaluate the performance of mo…
  continue reading
 
How do we measure skills in an age of AI? That question has an effect on everything from hiring to productive teamwork. Join Kian Katanforoosh, founder and CEO of Workera, and Ben Lorica for a discussion of how we can use AI to assess skills more effectively. How do we get beyond pass/fail exams to true measures of a person’s ability? Points of Int…
  continue reading
 
Chloé Messdaghi and Ben Lorica discuss AI security—a subject of increasing importance as AI-driven applications roll out into the real world. There’s a knowledge gap: Security workers don’t understand AI, and AI developers don’t understand security. It’s important to be aware of all the resources that are available. Make sure to bring everyone toge…
  continue reading
 
Join Ben Lorica and Tom Smoker for a discussion of GraphRAG, one of the hottest topics of the last few months. GraphRAG goes a step beyond RAG to make the output of language models more consistent, accurate, and explainable. But what is a graph? A graph is a way of structuring data. In the end, it’s the structure that’s important, along with the wo…
  continue reading
 
Robert Nishihara is one of the creators of Ray and cofounder of Anyscale, a platform for high-performance distributed data analysis and artificial intelligence. Ben Lorica and Robert discuss the need for data for the next generation of AI, which will be multimodal. What kinds of data will we need to develop models for video and multimodal data? And…
  continue reading
 
In this episode, Ben Lorica talks with Claire Vo, chief product officer at Launch Darkly and founder of ChatPRD. AI gives us a new set of tools that make everyone more productive and efficient. Those tools will allow more experimentation; they will allow more people to participate in product development; and they will create new opportunities for s…
  continue reading
 
Join us for a conversation between Ben Lorica and Matt Welsh, cofounder of Fixie.ai, former engineer at Apple and Google, and one of Mark Zuckerberg’s professors at Harvard. Learn how AI is changing computing. Whether it’s in C or a human language, programming is telling a computer what you want it to do—but AI opens up new classes of things that w…
  continue reading
 
What can AI do to improve healthcare? Kingsley Ndoh, founder of Hurone AI, talks with Ben Lorica about how Hurone is making cancer care more effective for people who are underserved by the medical system. He discusses how AI can streamline the medical process, both helping doctors to treat patients more effectively and making clinical trials more d…
  continue reading
 
Rikin Gandhi, CTO of Digital Green, talks with Ben Lorica about using generative AI to help farmers in developing countries become more productive. Farmer.Chat integrates information from training videos, sources of weather and crop information, and other data sources in a multimodal app that farmers can use in real-time. Points of Interest 0:45: D…
  continue reading
 
Timothy Persons of PricewaterhouseCoopers (PwC) talks with Ben Lorica about adoption of AI in the enterprise. They discuss the challenges enterprises experience, including the need to change corporate culture. To succeed, it’s important to focus on solving well-defined problems rather than just doing something cool with AI. Good data strategies and…
  continue reading
 
Alfred Spector has been a leader in AI and machine learning at Google, IBM, and Two Sigma. He is now a visiting scholar at MIT, an advisor at Blackstone, and coauthor of the text book Data Science in Context. Alfred talks with Ben Lorica about what people developing with AI need to be successful. Succeeding with AI is about more than just a model. …
  continue reading
 
Everyone is talking about agents: single agents and, increasingly, multi-agent systems. What kind of applications will we build with agents, and how will we build with them? How will agents communicate with each other effectively? Why do we need a protocol like A2A to specify how they communicate? Join Ben Lorica as he talks with […]…
  continue reading
 
Andrew Ng is one of the pioneers of modern AI. He was Google Brain’s founding technical lead, Coursera’s founder, Baidu’s Chief Scientist, DeepLearning.ai’s founder, a Professor at Stanford—and much more. Andrew talks with Ben Lorica about scaling AI, agents, the future of open source AI, and openness among AI researchers. Have you experienced an “…
  continue reading
 
Gwendolyn Stripling, author of Low-Code AI, talks about the democratization of AI, the primacy of data, the future of data science, and the coming of agents. It’s easy to think that AI is all about algorithms and models but it’s not; it’s really about understanding the business use case and the data that can be applied to that use case. We’re only …
  continue reading
 
Justin Norman, author of Product Management for AI and co-founder of Vera, a startup focused on security for generative AI, talks with Ben Lorica about how product management has changed since Generative AI came on the scene. He discusses the issues retrieval-augmented generation (RAG) raises for product management; how reliability has become part …
  continue reading
 
Pete Warden, founder of Useful Sensors and co-author of TinyML, discusses use cases for artificial intelligence that we rarely think about: how can you run AI on very small systems? How can you put AI on consumer devices in ways that are actually useful and not just buzzword-compliant? AI doesn’t have to rely on massive GPU farms. Pete talks about …
  continue reading
 
O’Reilly’s Generative AI in the Enterprise survey reported that people have trouble coming up with appropriate enterprise use cases for AI. Why is it hard to come up with appropriate use cases? Chip Huyen, cofounder of Claypot AI and author of Designing Machine Learning Systems, talks about why many companies have trouble coming up with appropriate…
  continue reading
 
Jay Alammar, director and Engineering Fellow at Cohere, joins Ben Lorica to talk about building AI applications for the enterprise, using RAG effectively, and the evolution of RAG into agents. Listen in to find out what kinds of metadata you need when you’re onboarding a new model or agent; discover how an emphasis on evaluation […]…
  continue reading
 
Phillip Carter, formerly of Honeycomb, and Ben Lorica talk about observability and AI—what observability means, how generative AI causes problems for observability, and how generative AI can be used as a tool to help SREs analyze telemetry data. There’s tremendous potential because AI is great at finding patterns in massive datasets, but it’s still…
  continue reading
 
Audio is being added to AI everywhere: both in multimodal models that can understand and generate audio and in applications that use audio for input. Now that we can work with spoken language, what does that mean for the applications that we can develop? How do we think about audio interfaces—how will people use them, […]…
  continue reading
 
Join Danielle Belgrave and Ben Lorica for a discussion of AI in healthcare. Danielle is VP of AI and machine learning at GSK (formerly GlaxoSmithKline). She and Ben discuss using AI and machine learning to get better diagnoses that reflect the differences between patients. Listen in to learn about the challenges of working with health […]…
  continue reading
 
The plastic shamans of OpenAI 🔥 Hot Course Offers:- 🤖 Master GenAI Engineering - Build Production AI Systems- 🦀 Learn Professional Rust - Industry-Grade Development- 📊 AWS AI & Analytics - Scale Your ML in Cloud- ⚡ Production GenAI on AWS - Deploy at Enterprise Scale- 🛠️ Rust DevOps Mastery - Automate Everything🚀 Level Up Your Career:- 💼 Production…
  continue reading
 
Dangerous Dilettantes vs. Toyota Way Engineering Core Thesis The influx of AI-powered automation tools creates dangerous dilettantes - practitioners who know just enough to be harmful. The Toyota Production System (TPS) principles provide a battle-tested framework for integrating automation while maintaining engineering discipline. Historical Conte…
  continue reading
 
Loading …
Copyright 2025 | Privacy Policy | Terms of Service | | Copyright
Listen to this show while you explore
Play