Multimodal Few-Shot Learning with Frozen Language Models | Paper Explained

❤️ Become The AI Epiphany Patreon ❤️ ► In this video I cover “Multimodal Few-Shot Learning with Frozen Language Models“ from DeepMind. They introduce Frozen - which is able to handle both visual and textual inputs and shows good generalization capabilities to novel visual question answering datasets combined with fast binding mechanisms even though it was only trained on image captioning. ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬ ✅ Paper: ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬ ⌚️ Timetable: 00:00 Intro 02:20 GPT-3 and emerging few-shot properties 04:20 Training procedure for Frozen
Back to Top