October 25th, 2023 - Pixel to Perception: Matryoshka Synthesis, GPT-3's Linguistic Mysteries, Woodpecker's Visual Refinement, and SAM-CLIP's Vision Evolution
MP3•Episode home
Manage episode 380873645 series 3485608
Content provided by Marcus Edel. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Marcus Edel or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://staging.podcastplayer.com/legal.
…
continue reading
Chapters
1. Intro (00:00:00)
2. Matryoshka Diffusion Models (00:01:12)
3. Dissecting In-Context Learning of Translations in GPTs (00:04:51)
4. Woodpecker: Hallucination Correction for Multimodal Large Language Models (00:06:07)
5. SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding (00:08:25)
75 episodes