Andrej Karpathy, Eureka Labs founder and computer scientist (Tesla, OpenAI), explains how language models work, and are built. You'll need about 3.5 hours to view the whole video, but it covers transformer networks, training (human and algorithmic; labeling) and reinforcement learning.
Friday, April 18, 2025
How LLMs Work, by Andrej Karpathy

Subscribe to:
Post Comments (Atom)
Amazon, Alphabet, Meta, Microsoft Capex is 3.5% of Global Total
In one sense, capital investment in data centers and artificial intelligence by Amazon, Alphabet, Meta and Microsoft represents only about 3...
-
We have all repeatedly seen comparisons of equity value of hyperscale app providers compared to the value of connectivity providers, which s...
-
It really is surprising how often a Pareto distribution--the “80/20 rule--appears in business life, or in life, generally. Basically, the...
-
One recurring issue with forecasts of multi-access edge computing is that it is easier to make predictions about cost than revenue and infra...
No comments:
Post a Comment