Search a title or topic

Over 20 million podcasts, powered by 

Player FM logo

Llm Evaluation Podcasts

show episodes
 
All Things LLM is your go-to podcast for demystifying Large Language Models! We break down their core concepts—like tokens, embeddings, and the self-attention that powers GPT-4 and Llama. Learn how LLMs are built, trained, and fine-tuned (SFT, RLHF, PEFT) on massive datasets. Discover real-world use cases in healthcare, finance, chatbots, code, RAG, and more. We explore the LLM ecosystem, covering open-source vs. closed models, LLMaaS, LangChain, and LLMOps tools. Plus, we tackle challenges— ...
  continue reading
 
The Everyday AI podcast is a daily livestream, podcast and free newsletter where we help everyday people grow their careers with AI. The Everyday AI podcast is hosted by Jordan Wilson, a former journalist who's now the owner of a boutique digital strategy company with 20 years of martech experience. Our main focus is to help you keep up with AI trends to make your job easier. Get your work done faster. Increase your output. - Sign up for our free Prime Prompt Polish ChatGPT course: https://p ...
  continue reading
 
Software engineers, architects and team leads have found inspiration to drive change and innovation in their team by listening to the weekly InfoQ Podcast. They have received essential information that helped them validate their software development map. We have achieved that by interviewing some of the top CTOs, engineers and technology directors from companies like Uber, Netflix and more. Over 1,200,000 downloads in the last 3 years.
  continue reading
 
Artwork

1
AWS Podcast

Amazon Web Services

icon
Unsubscribe
icon
icon
Unsubscribe
icon
Weekly
 
The Official AWS Podcast is a podcast for developers and IT professionals looking for the latest news and trends in storage, security, infrastructure, serverless, and more. Join Simon Elisha and Hawn Nguyen-Loughren for regular updates, deep dives, launches, and interviews. Whether you’re training machine learning models, developing open source projects, or building cloud solutions, the Official AWS Podcast has something for you.
  continue reading
 
Machine learning and artificial intelligence are dramatically changing the way businesses operate and people live. The TWIML AI Podcast brings the top minds and ideas from the world of ML and AI to a broad and influential community of ML/AI researchers, data scientists, engineers and tech-savvy business and IT leaders. Hosted by Sam Charrington, a sought after industry analyst, speaker, commentator and thought leader. Technologies covered include machine learning, artificial intelligence, de ...
  continue reading
 
AXRP (pronounced axe-urp) is the AI X-risk Research Podcast where I, Daniel Filan, have conversations with researchers about their papers. We discuss the paper, and hopefully get a sense of why it's been written and how it might reduce the risk of AI causing an existential catastrophe: that is, permanently and drastically curtailing humanity's future potential. You can visit the website and read transcripts at axrp.net.
  continue reading
 
Loading …
show series
 
How can you measure ROI on GenAI for your team? 🤔 Internal evaluations and intentionality. We've helped thousands of orgs put LLMs to work and ACTUALLY save time. On today's show, we're dishing the 7 steps you need to follow. What’s the best LLM for your team? 7 Steps to evaluate and create ROI for AI -- An Everyday AI chat with Jordan Wilson Newsl…
  continue reading
 
In this podcast, InfoQ spoke with Elena Samuylova from Evidently AI, on best practices in evaluating Large Language Model (LLM) based applications. She also discussed the tools for evaluating, testing and monitoring applications powered by AI technologies.Read a transcript of this interview: https://bit.ly/4mHAKvN Subscribe to the Software Architec…
  continue reading
 
In the season finale of "All Things LLM," hosts Alex and Ben turn to one of the most important—and challenging—topics in AI: How do we objectively evaluate the quality and reliability of a language model? With so many models, benchmarks, and metrics, what actually counts as “good”? In this episode, you’ll discover: The evolution of LLM evaluation: …
  continue reading
 
Jeetu Patel knows a few AI secrets. As the President of one of the largest companies in the world, he's helped pave the AI adoption roadmap. At Cisco, they provide full-stack, enterprise AI solutions spanning infrastructure, security, observability, and operations to the world's largest companies. So naturally, Jeetu could write a legit playbook on…
  continue reading
 
You haven't used ChatGPT's Apps yet? 🫠 Oh.... you like wasting time? Even for free users, ChatGPT rolled out its new Apps mode that promises to shift the future of work. Don't know how to work it? Don't know where to start? Join us as we share 3 practical ways to start saving time today. ChatGPT Apps: 3 Hands-on approaches to save time today -- An …
  continue reading
 
In this episode, we're joined by Kunle Olukotun, professor of electrical engineering and computer science at Stanford University and co-founder and chief technologist at Sambanova Systems, to discuss reconfigurable dataflow architectures for AI inference. Kunle explains the core idea of building computers that are dynamically configured to match th…
  continue reading
 
Will this be AI's 'App Store Moment'? 🤔 OpenAI's Apps are live, and the consensus is split. Some are calling them a revolutionary step forward while others are saying it's another marketing flop. What's our hot take? Join us and find out. AI’s App Store Moment? Are ChatGPT’s Apps The Next Big Thing or Smoke and Mirrors? An Everyday AI Chat with Jor…
  continue reading
 
OpenAI debuted the future of ChatGPT with Agents and Apps. How will that impact work? 🤖 Google dropped Gemini for Enterprise. Does that make them the top AI option for the big players? 🏢 Everyone is talking about the AI bubble. Is it real and will it burst? 🫧 If you have questions over what's happening in the world of AI news, we've got answers. Jo…
  continue reading
 
In this episode of the AWS Podcast, host Jillian Forde discusses the migration journey of Booking.com to AWS with Ali and Sarah. They explore the challenges faced by Booking.com , the benefits of using CloudFront and Lambda at Edge, and the importance of observability and cost optimization. The conversation also delves into chaos engineering practi…
  continue reading
 
In this podcast, Michael Stiefel spoke with Nimisha Asthagiri about the importance of system thinking, multi-agent systems, the consequences of society applying a technology into an area for which it was not designed, and whether we can ever have a healthy relationship with artificial intelligence. System thinking emphasizes the importance of menta…
  continue reading
 
Breaking: Google just released Gemini Enterprise. 🚨 Will it be a ChatGPT or Microsoft Copilot killer? We got our hands on a version of the newest release and will break down everything you need to know, including the ONE feature that could ultimately set Gemini Enterprise apart. Google Gemini Enterprise: Coming for ChatGPT and Microsoft Copilot? --…
  continue reading
 
Have you been sleeping on NotebookLM? 😴 If so, you're leaving hours of productivity (and probably a lot of money) at the door. But real talk -- the team is shipping fast. The NotebookLM you met last year from the viral Audio Overviews is not the NotebookLM of today. It's slowly turned into a robust, multimedia powerhouse. And the last feature updat…
  continue reading
 
Today, we're joined by Jacob Buckman, co-founder and CEO of Manifest AI to discuss achieving long context in transformers. We discuss the bottlenecks of scaling context length and recent techniques to overcome them, including windowed attention, grouped query attention, and latent space attention. We explore the idea of weight-state balance and the…
  continue reading
 
Will this be the AI update that finally brings AI agents to millions? 🤔 Probably. OpenAI had a straight up feast of AI drops at its Dev Day conference, but one of the biggest was its drag-and-drop agent builder. Oh, and literally bringing entire website experiences into ChatGPT via apps. Don't miss this one. Ep 626: ChatGPT’s new Agent Builder, App…
  continue reading
 
OpenAI is dropping a visual agent builder 🤯 There's a HUGE report on AI job losses.... Sora 2 gets good and bad news... And that's just the beginning. Make sure to join us for this week's AI News That Matters! EP 625: Sora 2 release and update, OpenAI and AMD partner, ChatGPT Visual Agent Builder incoming and more Newsletter: Sign up for our free d…
  continue reading
 
If most companies are using the same AI systems, how can they stand out and get ahead? And as agentic AI becomes table stakes, what do enterprises need to keep in mind to make AI work? And how can we even trust an AI-powered workplace when most people can't even explain the basics of AI? We're learning from the experts. Accenture's Mary Hamilton jo…
  continue reading
 
Everyone's talking about Sora 2. 🗣️ - How good it is - How it's going after TikTok as a social media app - The downsides of cyclical AI brain rot But, you're missing the big upside. On today's show, we're going to tell you how businesses should be focusing on the untapped potential of Sora 2. Want to enter our Sora code giveaway? Go repost today's …
  continue reading
 
You definitely missed this AI drop 👇 AI functions are live inside of Google Sheets thanks to Gemini. I know what you're saying... AI functions? I can barely spread the sheets! That's the beauty of it. With a simple =AI of your keyboard, you can talk to your spreadsheets in natural language and tap into the full power of Google Gemini. Don't worry..…
  continue reading
 
In this episode, Illia Polosukhin, a co-author of the seminal "Attention Is All You Need" paper and co-founder of Near AI, joins us to discuss his vision for building private, decentralized, and user-owned AI. Illia shares his unique journey from developing the Transformer architecture at Google to building the NEAR Protocol blockchain to solve glo…
  continue reading
 
Is this the AI agent we've all been waiting for? 🤔 Maybe. Microsoft just unveiled their highly capable Agent Mode and Office Agent. Capabilities? Through the roof. Execution, rollout and availability? ummmm...... Join us as we cut through the fluff on these new AI agents from Microsoft and separate the game-changing features from the shiny marketin…
  continue reading
 
Is Apple serious about AI now with a new internal model? 🤔 Why did Accenture lay off 11,000 and how many more will go due to AI? How the heck is OpenAI signing so many multi-billion dollar partnerships? So many AI questions. We've got the AI answers. Don't waste hours each week trying to keep up with the AI news. We do that for you with our Mondays…
  continue reading
 
Software supply chain veteran Brian Fox unpacks the security implications of the new EU Cyber Resilience Act and its profound impact on open-source projects. He reveals the hidden infrastructure risks threatening open-source projects and shares insights for senior software leaders navigating this regulatory landscape.Read a transcript of this inter…
  continue reading
 
Everyone know Google's Nano Banana is bonkers good. 🍌 But did you know you can create an app in minutes that embeds Nano Banana.... and it takes zero coding experience?! 🤯 If you haven't used Google's Gemini 2.5 Flash (AKA Nano Banana), you're in for a treat as Google's Paige Bailey gives us the insider's guide. Nano Banana Uncovered: A practical g…
  continue reading
 
AI scraping vs. the open web. Who wins? 🥊 Let's say the quiet part out loud: AI companies have trained their models for years on your company's website data, regardless of if you want them to. Fast forward to today: many publishers have lost up to 70% of website traffic (and huge chunks of revenue) because of this. So what happens if some of these …
  continue reading
 
AI is coming for the world's most popular browser. 🤖 Google recently announced Gemini in Chrome, bringing AI (and eventually) agentic features to the browser used by more than 3 billion people. We've been using Gemini in Chrome for a few months, so we're recapping some of our favorite features, tips and 6 easy use cases that you can take advantage …
  continue reading
 
In this episode of the podcast, members of the InfoQ editorial staff and friends of InfoQ discuss the current trends in the domain of AI, ML and Data Engineering.One of the regular features of InfoQ are the trends reports, which each focus on a different aspect of software development. These reports provide the InfoQ readers and listeners with a hi…
  continue reading
 
Today, we’re joined by Oliver Wang, principal scientist at Google DeepMind and tech lead for Gemini 2.5 Flash Image—better known by its code name, “Nano Banana.” We dive into the development and capabilities of this newly released frontier vision-language model, beginning with the broader shift from specialized image generators to general-purpose m…
  continue reading
 
The headline was kinda shocking 👇 "Satya Nadella is haunted at the prospect of Microsoft not surviving the AI era" This was a story in The Verge that detailed an internal Microsoft town hall and its CEO's kinda urgent plea on Microsoft keeping pace. But real talk now..... if one of the key companies pushing AI innovation is kinda haunted by keeping…
  continue reading
 
OpenAI is now going after .... Apple? 🍎 Google is finally bringing more AI to its browser. 🖥️ And Microsoft's CEO has cast a kinda gloomy warning on AI. ⚠️ And that's just the beginning. Don't waste hours each day trying to keep up with AI. That's what we do. OpenAI going after Apple hardware, Google brings Gemini to Chrome, Microsoft CEO’s dire wa…
  continue reading
 
Make way for the next wave of GenAI..... Agentic AI Browsers. And while we've seen rumors that OpenAI is going all-in on an AI browser, the first big player is already here in Perplexity's Comet Browser. Join us as we break down how Perplexity Comet works, what makes it different, and 5 Business Use-Cases for ROI. Newsletter: Sign up for our free d…
  continue reading
 
What happens when AI not only understands the world, but acts in it? In this trailblazing episode of "All Things LLM," Alex and Ben chart the rise of next-generation AI: autonomous agents and Large Action Models (LAMs). Discover how LLMs are evolving from passive text generators to powerful doers—reshaping workflows, business automation, and the ve…
  continue reading
 
In the grand finale of "All Things LLM," hosts Alex and Ben look ahead to the bleeding edge—and reflect on the ultimate question for AI: can we ever truly understand how these models think? Inside this episode: The rise of reasoning models: Discover why the next leap for AI isn’t just bigger models, but smarter thinking. Explore how OpenAI’s o1 and…
  continue reading
 
AI’s next great leap isn’t about bigger models—it’s about broader senses. In this season premiere of "All Things LLM," Alex and Ben explore the revolutionary world of multimodal large language models (LLMs)—the new frontier where AI can “see,” “hear,” and “understand” the world far beyond text. In this episode: Journey to Multimodality: Discover wh…
  continue reading
 
As LLMs power more business workflows, security risks grow. In this essential episode of "All Things LLM," hosts Alex and Ben break down the new wave of cybersecurity threats targeting language models—and what you can do to defend your AI infrastructure. What you’ll learn: The OWASP Top 10 for LLMs: Explore the most pressing LLM security risks and …
  continue reading
 
Season 4 of "All Things LLM" kicks off with one of the most crucial debates in AI today: open-source vs. closed-source (proprietary) language models. Hosts Alex and Ben cut through the hype to explain what’s at stake for businesses, developers, and the entire AI ecosystem. In this episode, you’ll discover: The fundamentals: What truly sets open-sou…
  continue reading
 
Powerful language models are reshaping the world, but serious challenges remain. In this revealing episode of "All Things LLM," hosts Alex and AI expert Ben tackle the core limitations and ethical risks facing all large language models—open or closed. This episode covers: Hallucinations: Why LLMs make up plausible-sounding but false or misleading a…
  continue reading
 
How do we make AI not just smart, but safe and genuinely helpful? In this episode of "All Things LLM," Alex and Ben break down the vital process of alignment—transforming a powerful language model into a trustworthy assistant you can rely on. Inside this episode: What is RLHF? Discover Reinforcement Learning from Human Feedback—the multi-stage proc…
  continue reading
 
Get an insider’s look behind the curtain of modern AI with this episode of "All Things LLM." Join hosts Alex and AI expert Ben as they reveal the colossal effort, expense, and ingenuity required to take a language model from “blank slate” to foundational intelligence. What you’ll learn: The massive scale of LLM training: how developers assemble and…
  continue reading
 
Unlock the key to modern AI with this deep-dive episode of "All Things LLM"! Hosts Alex and our resident AI expert Ben unpack the “self-attention mechanism”—the heart of every powerful Transformer model powering GPT, Llama, Gemini, and more. Discover: What “self-attention” actually means in the context of language models—and why it’s a game-changer…
  continue reading
 
Unlock the full potential of large language models with this hands-on episode of "All Things LLM." Hosts Alex and AI expert Ben break down the essential (and rapidly evolving) discipline of prompt engineering—your steering wheel for directing AI toward more relevant, accurate, and actionable outputs. What you’ll learn: Prompt Engineering 101: Why c…
  continue reading
 
Discover how generalist AIs become powerful specialists in this episode of "All Things LLM." Hosts Alex and AI expert Ben break down the next stage of the LLM lifecycle—customization—and unpack the practical techniques that transform foundation models into domain experts or business-ready assistants. Learn about: Fine-Tuning: Why it’s essential for…
  continue reading
 
Unlock the mysteries of modern AI with "All Things LLM." In this episode, Alex and Ben break down the Transformer—the revolutionary engine powering today’s Large Language Models (LLMs) like GPT-4, Llama, and Gemini. If you’ve ever wondered how AI can both understand and generate text, this deep dive into Transformer architecture is your essential g…
  continue reading
 
Curious how AI language models like ChatGPT burst into the mainstream? Welcome to "All Things LLM," where hosts Alex and AI expert Ben unravel the true origins and evolution of Large Language Models. In this episode, we journey through more than a century of discoveries that paved the way for today’s groundbreaking AI. Discover: The surprising root…
  continue reading
 
Loading …
Copyright 2025 | Privacy Policy | Terms of Service | | Copyright
Listen to this show while you explore
Play