Vision-Language Pre-Trained Models. Мы подробно разберём Flamingo, BLIP-2, LLaVA и LLaVA-1.5
Оторвитесь от предновогодней суеты и уделите один вечер знаниям: 19 декабря в 20:00 пройдёт семинар от VK Lab.
Наш стажёр Даниил Белопольских расскажет про мультимодальные модели, а именно: Vision-Language Pre-Trained Models. Мы подробно разберём Flamingo, BLIP-2, LLaVA и LLaVA-1.5.
А ещё вы узнаете:
— в чём сложность взаимодействия изображений и текста;
— какие датасеты нужны для обучения таких моделей;
— как их сравнивать.
В конце семинара обязательно ответим на ваши вопросы. Подключайтесь!
1,280 view
1436
440
9 months ago 00:05:01 3
SORA Video To Video Is Literally Mind Blowing - 12 HD Demos - Changes Industry Forever For Real
11 months ago 00:52:32 1.3K
Vision-Language Pre-Trained Models. Мы подробно разберём Flamingo, BLIP-2, LLaVA и LLaVA-1.5
1 year ago 00:07:07 30
Build Eye Detection with Python using OpenCV
1 year ago 00:13:45 8
No, this angry AI isn’t fake (see comment), w Elon Musk.
2 years ago 00:07:12 8
OpenCV Python Tutorial For Beginners 36 - Eye Detection Haar Feature based Cascade Classifiers
2 years ago 00:08:12 1
(12) Google’s New Self-Driving Robot Is Amazing! 烙 - YouTube
3 years ago 00:46:41 5
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding&Generation
3 years ago 00:38:35 4
Can Wikipedia Help Offline Reinforcement Learning? (Paper Explained)
3 years ago 01:01:52 4
Zeta Alpha’s Trends in AI — February 2022. ConvNets comeback, Neural IR, Multimodal
3 years ago 00:17:22 4
Harvard Medical AI: Sameer Sundrani presents “Oscar: ... Pre-training for Vision-Language Tasks“
3 years ago 00:37:22 6
[ML News] DeepMind builds Gopher | Google builds GLaM | Suicide capsule uses AI to check access
3 years ago 00:50:23 4
MedAI Session 23: Multimodal medical research of vision and language | Jean-Benoit Delbrouck
4 years ago 00:13:09 10
ResNet Architecture and Residual Block Explained - Neural Networks and Deep Learning
4 years ago 00:34:02 17
Pretrained Transformers as Universal Computation Engines (Machine Learning Research Paper Explained)
4 years ago 00:21:04 19
How To Use The Pre-trained Neural Network MobileNet From Keras and TensorFlow
4 years ago 00:05:08 10
AI 360: 01/03/2021. Unified Transformer, Sebastian Ruder, OpenAI’s DALL-E, GLOM and StudioGAN
4 years ago 00:01:38 15
Intelligent End-to-End AI Chatbot with Audio-Driven Facial Animation
5 years ago 00:51:34 3
AWS DevDays 2020 - An Introduction to Deep Learning Theory and Use Cases
5 years ago 00:11:50 8
BERT Can See Out of the Box
5 years ago 00:57:26 12
Transfer Learning for Image Classification (Webinar by Bhavesh Laddagiri, recorded on 19th. Dec’19)
5 years ago 01:23:07 1
Computer Vision with CNN ( Convolutional Neural Networks ) | Deep Learning | Great Learning