Friday, February 14, 2025

DeepSeek Might be an Example of Continuity More than We Think

Perhaps I will change my mind as I learn more, but right now DeepSeek has added some clever innovation to language model training and inference costs. But other key contestants seem to be working in parallel.


Watch for coming releases from Anthropic and OpenAI, for example. And other developers have been working on similar approaches to DeepSeek. Gemini, for example, uses the “Mixture of Experts” approach also used by DeepSeek. 


The point is that model training and inference costs already were dropping fast, as is typical for all computing processes. 


source: Bain

No comments:

What "Boomers" Messed Up

Journalist Helen Andrews is not going to be popular with lots of readers of her book Boomers: The Men and Women Who Promised Freedom and De...