Search a title or topic

Over 20 million podcasts, powered by 

Player FM logo
Artwork

Content provided by NPR. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by NPR or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://staging.podcastplayer.com/legal.
Player FM - Podcast App
Go offline with the Player FM app!

When AI Cannibalizes Its Data

13:23
 
Share
 

Manage episode 467224734 series 2555353
Content provided by NPR. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by NPR or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://staging.podcastplayer.com/legal.
Asked ChatGPT anything lately? Talked with a customer service chatbot? Read the results of Google's "AI Overviews" summary feature? If you've used the Internet lately, chances are, you've consumed content created by a large language model. These models, like DeepSeek-R1 or OpenAI's ChatGPT, are kind of like the predictive text feature in your phone on steroids. In order for them to "learn" how to write, the models are trained on millions of examples of human-written text. Thanks in part to these same large language models, a lot of content on the Internet today is written by generative AI. That means that AI models trained nowadays may be consuming their own synthetic content ... and suffering the consequences.
View the AI-generated images mentioned in this episode.
Have another topic in artificial intelligence you want us to cover? Let us know my emailing [email protected]!
Listen to every episode of Short Wave sponsor-free and support our work at NPR by signing up for Short Wave+ at
plus.npr.org/shortwave.
Learn more about sponsor message choices: podcastchoices.com/adchoices
NPR Privacy Policy
  continue reading

1280 episodes

Artwork

When AI Cannibalizes Its Data

Short Wave

1,355 subscribers

published

iconShare
 
Manage episode 467224734 series 2555353
Content provided by NPR. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by NPR or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://staging.podcastplayer.com/legal.
Asked ChatGPT anything lately? Talked with a customer service chatbot? Read the results of Google's "AI Overviews" summary feature? If you've used the Internet lately, chances are, you've consumed content created by a large language model. These models, like DeepSeek-R1 or OpenAI's ChatGPT, are kind of like the predictive text feature in your phone on steroids. In order for them to "learn" how to write, the models are trained on millions of examples of human-written text. Thanks in part to these same large language models, a lot of content on the Internet today is written by generative AI. That means that AI models trained nowadays may be consuming their own synthetic content ... and suffering the consequences.
View the AI-generated images mentioned in this episode.
Have another topic in artificial intelligence you want us to cover? Let us know my emailing [email protected]!
Listen to every episode of Short Wave sponsor-free and support our work at NPR by signing up for Short Wave+ at
plus.npr.org/shortwave.
Learn more about sponsor message choices: podcastchoices.com/adchoices
NPR Privacy Policy
  continue reading

1280 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Listen to this show while you explore
Play