Friday, April 18, 2025

How LLMs Work, by Andrej Karpathy

Andrej Karpathy, Eureka Labs founder and computer scientist (Tesla, OpenAI), explains how language models work, and are built. You'll need about 3.5 hours to view the whole video, but it covers transformer networks, training (human and algorithmic; labeling) and reinforcement learning. 

No comments:

Yes, Follow the Data. Even if it Does Not Fit Your Agenda

When people argue we need to “follow the science” that should be true in all cases, not only in cases where the data fits one’s political pr...