Artwork
iconShare
 
Manage episode 388370178 series 3402048
Content provided by Joe Carlsmith. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Joe Carlsmith or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://staging.podcastplayer.com/legal.
  continue reading

Chapters

1. Speed arguments against scheming (Section 4.4-4.7 of "Scheming AIs") (00:00:00)

2. 4.4 Speed arguments (00:00:29)

3. 4.4.1 How big are the absolute costs of this extra reasoning? (00:02:22)

4. 4.4.2 How big are the costs of this extra reasoning relative to the simplicity benefits of (00:07:06)

5. 4.4.3 Can we actively shape training to bias towards speed over simplicity? (00:09:21)

6. 4.5 The “not-your-passion” argument (00:10:27)

7. 4.6 The relevance of “slack” to these arguments (00:12:46)

8. 4.7 Takeaways re: arguments that focus on the final properties of the model (00:13:38)

67 episodes