In this video, we are going to implement the GPT2 model from scratch. We are only going to focus on the inference and not on the training logic. We will cover concepts like self attention, decoder blocks and generating new tokens.
Paper:
Code minGPT:
Code transformers: #L946
Code from the video:
00:00 Intro
01:32 Overview: Main goal [slides]
02:06 Overview: Forward pass [slides]
03:39 Overview: GPT module (part 1) [slides]
04:28 Overview: GPT module (part 2) [slides]
05:25 Overview: Decoder block [slides]
06:10 Overview: Masked self attention [slides]
07:52 Decoder module [code]
13:40 GPT module [code]
18:19 Copying a tensor [code]
19:26 Copying a Decoder module [co
13 views
25
6
2 months ago 00:00:24 1
STAYC(스테이씨) ’GPT’ MV Teaser
2 months ago 00:24:32 1
Andrew Ng’s ENTIRE Intro AI Course in 25 minutes
2 months ago 00:10:29 1
VectorShift: Create Your PERSONAL AI Agents in SECONDS FOR FREE! (w/ Third Party Integrations)
2 months ago 00:16:56 1
Big Tech AI Is A Lie
2 months ago 00:07:56 1
How to make $2300 Daily With Ethereum Trading Bot 2024
2 months ago 00:00:44 1
Alya Sometimes Yells at Her Teammates in Russian / Alya Speaking Russian meme / Roshidere meme
2 months ago 00:02:17 1
Introducing Unitree Go2 - Quadruped Robot of Embodied AI from $1600
2 months ago 00:01:35 1
gaslighting ai into 2+2=5
2 months ago 00:04:37 1
Claude has taken control of my computer...
2 months ago 00:10:40 1
Получай любую информацию в 10 раз быстрее. Обгони 99% других разработчиков.
2 months ago 00:14:45 1
Как установить Stable Diffusion 3.5 Large и Turbo на компьютер? Пошаговая инструкция для Windows.
2 months ago 00:16:03 1
Humanizing AI Writing: How to Make It Sound More Natural with AI Detector Pro
2 months ago 00:22:58 1
Как добавить эмоции персонажам в нейросетях. Разбираем на примере аниме-демонов к Хэллоуину.
2 months ago 02:47:55 1
Подкаст cp0x #11, часть Б: Антон Буков (1inch) о JIT, ZK, AI, алгоритмах, DeFi, лендингах и хакатоне
2 months ago 01:01:32 1
1 HOUR | Aggressive Dark Cyberpunk \ Dark Electro Mix \ Industrial Mix \ Dark Techno
2 months ago 00:09:16 1
NEW Code Interpreter: Powerful AI Coding Agent (Generate Apps, Code, & Debugg) - Opensource!
2 months ago 00:08:51 1
gptme: Opensource AI Agent That Can Do ANYTHING! (Generate Apps, Code, Automate Your Life)
2 months ago 00:01:25 1
STAYC(스테이씨) ’GPT’ MV Synopsis & Inst. Pre-release
2 months ago 00:08:04 1
NinjaChat (Upgraded) : This AI Platform has EVERYTHING including GPT-4O, Claude, FLUX, Kling, etc.
2 months ago 00:09:57 1
Ministral (Fully Tested) : This NEW Mistral Model is the Llama-3.1 REPLACEMENT! (Good at Coding!)
2 months ago 00:08:38 1
Ditto : This CODING Agent can Generate Applications with Flask in Seconds! (w/ Ollama Support)
2 months ago 00:10:37 1
Lightning AI + Cline + Aider + Supermaven : This 100% FREE AI Editor WITH GPU is AMAZING (w/ Ollama)
2 months ago 00:41:30 1
Cyberpunk Music / EBM / Dark Electro Mix / Dark Industrial / Dark Techno
2 months ago 00:17:28 1
Chat GPT, The FUTURE Of AI And Is It The End Of Human Civilization? | Tom Bilyeu