Friday, April 18, 2025

How LLMs Work, by Andrej Karpathy

Andrej Karpathy, Eureka Labs founder and computer scientist (Tesla, OpenAI), explains how language models work, and are built. You'll need about 3.5 hours to view the whole video, but it covers transformer networks, training (human and algorithmic; labeling) and reinforcement learning. 

No comments:

Amazon, Alphabet, Meta, Microsoft Capex is 3.5% of Global Total

In one sense, capital investment in data centers and artificial intelligence by Amazon, Alphabet, Meta and Microsoft represents only about 3...