6c3* | Let's build GPT: from scratch, in code, spelled out. - Andrej Karpathy
https://www.youtube.com/watch?v=kCc8FmEb1nY
Saved on 2023-05-12 [19489 edays] via youtube.com
Modified 2023-08-19 [19588 edays]
ai presentations programming python

We build a Generatively Pretrained Transformer (GPT), following the paper "Attention is All You Need" and OpenAI's GPT-2 / GPT-3. We talk about connections to ChatGPT, which has taken the world by storm. We watch GitHub Copilot, itself a GPT, help us write a GPT (meta :D!) . I recommend people watch the earlier makemore videos to get comfortable with the autoregressive language modeling framework and basics of tensors and PyTorch nn, which we take for granted in this video.