Andrej Karpathy, Eureka Labs founder and computer scientist (Tesla, OpenAI), explains how language models work, and are built. You'll need about 3.5 hours to view the whole video, but it covers transformer networks, training (human and algorithmic; labeling) and reinforcement learning.
Friday, April 18, 2025
How LLMs Work, by Andrej Karpathy

Subscribe to:
Post Comments (Atom)
The Optimal Shopping Problem and Online Dating
The “Secretary Problem,” also known as the “Optimal Stopping Problem,” is an example of decision theory. It deals with the challenge of mak...
-
We have all repeatedly seen comparisons of equity value of hyperscale app providers compared to the value of connectivity providers, which s...
-
It really is surprising how often a Pareto distribution--the “80/20 rule--appears in business life, or in life, generally. Basically, the...
-
One recurring issue with forecasts of multi-access edge computing is that it is easier to make predictions about cost than revenue and infra...
No comments:
Post a Comment