Friday, April 18, 2025

How LLMs Work, by Andrej Karpathy

Andrej Karpathy, Eureka Labs founder and computer scientist (Tesla, OpenAI), explains how language models work, and are built. You'll need about 3.5 hours to view the whole video, but it covers transformer networks, training (human and algorithmic; labeling) and reinforcement learning. 

No comments:

Could Court-Ordered End to Google Search "Exclusive Placement" Actually be Good for Google?

One of the assertions in the “United States v. Google” 2020 antitrust case against Google is that Google acts as a monopolist in paying $26 ...