Go offline with the Player FM app!
Pre-training LLMs: One Model To Rule Them All? with Talfan Evans, DeepMind
Manage episode 418936584 series 3514761
Talfan Evans is a research engineer at DeepMind, where he focuses on data curation and foundational research for pre-training LLMs and multimodal models like Gemini. I ask Talfan:
- Will one model rule them all?
- What does "high quality data" actually mean in the context of LLM training?
- Is language model pre-training becoming commoditized?
- Are companies like Google and OpenAI keeping their AI secrets to themselves?
- Does the startup or open source community stand a chance next to the giants?
Also check out Talfan's latest paper at DeepMind, Bad Students Make Good Teachers.
26 episodes
Manage episode 418936584 series 3514761
Talfan Evans is a research engineer at DeepMind, where he focuses on data curation and foundational research for pre-training LLMs and multimodal models like Gemini. I ask Talfan:
- Will one model rule them all?
- What does "high quality data" actually mean in the context of LLM training?
- Is language model pre-training becoming commoditized?
- Are companies like Google and OpenAI keeping their AI secrets to themselves?
- Does the startup or open source community stand a chance next to the giants?
Also check out Talfan's latest paper at DeepMind, Bad Students Make Good Teachers.
26 episodes
All episodes
×Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.