Search a title or topic

Over 20 million podcasts, powered by 

Player FM logo
Artwork

Content provided by Enough About AI. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Enough About AI or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://staging.podcastplayer.com/legal.
Player FM - Podcast App
Go offline with the Player FM app!

Alignment Anxieties & Persuasion Problems

46:48
 
Share
 

Manage episode 482511791 series 3613033
Content provided by Enough About AI. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Enough About AI or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://staging.podcastplayer.com/legal.

Dónal and Ciarán continue the 2025 season with a second quarterly update that looks at some recent themes in AI development. They're pondering doom again, as we increasingly grapple with the evidence that AI systems are powerfully persuasive and full of flattery at the same time as our ability to meaningfully supervise them seems to be diminishing.

Topics in this episode

  • Can we see how reasoning models reason? If AI is thinking, or sharing information and it's not in human language, how can we check that it's aligned with our values.
  • This interpretability issue is tied to the concept of neuralese - inscrutable machine thoughts!
  • We discuss the predictions and prophetic doom visions of the AI-2027 document
  • Increasing ubiquity and sometimes invisibility of AI, as it's inserted into other products. Is this more enshittification?
  • AI is becoming a persuasion machine - we look at the recent issues on Reddit's r/ChangeMyView, where researchers skipped good ethics practice but ended up with worrying results
  • We talk about flattery, manipulation, and Eli Yudkowsky's AI-Box thought experiment

Resources & Links

You can get in touch with us - [email protected] - where we'd love to hear your questions, comments or suggestions!

  continue reading

8 episodes

Artwork
iconShare
 
Manage episode 482511791 series 3613033
Content provided by Enough About AI. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Enough About AI or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://staging.podcastplayer.com/legal.

Dónal and Ciarán continue the 2025 season with a second quarterly update that looks at some recent themes in AI development. They're pondering doom again, as we increasingly grapple with the evidence that AI systems are powerfully persuasive and full of flattery at the same time as our ability to meaningfully supervise them seems to be diminishing.

Topics in this episode

  • Can we see how reasoning models reason? If AI is thinking, or sharing information and it's not in human language, how can we check that it's aligned with our values.
  • This interpretability issue is tied to the concept of neuralese - inscrutable machine thoughts!
  • We discuss the predictions and prophetic doom visions of the AI-2027 document
  • Increasing ubiquity and sometimes invisibility of AI, as it's inserted into other products. Is this more enshittification?
  • AI is becoming a persuasion machine - we look at the recent issues on Reddit's r/ChangeMyView, where researchers skipped good ethics practice but ended up with worrying results
  • We talk about flattery, manipulation, and Eli Yudkowsky's AI-Box thought experiment

Resources & Links

You can get in touch with us - [email protected] - where we'd love to hear your questions, comments or suggestions!

  continue reading

8 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Listen to this show while you explore
Play