Friday, April 18, 2025

How LLMs Work, by Andrej Karpathy

Andrej Karpathy, Eureka Labs founder and computer scientist (Tesla, OpenAI), explains how language models work, and are built. You'll need about 3.5 hours to view the whole video, but it covers transformer networks, training (human and algorithmic; labeling) and reinforcement learning. 

No comments:

Anthropic Strategy: Productivity Platform

Anthropic’s (Claude) likely strategy is to evolve from a pure AI model/API provider into a fully integrated, end-to-end AI productivity plat...