Friday, February 14, 2025

DeepSeek Might be an Example of Continuity More than We Think

Perhaps I will change my mind as I learn more, but right now DeepSeek has added some clever innovation to language model training and inference costs. But other key contestants seem to be working in parallel.


Watch for coming releases from Anthropic and OpenAI, for example. And other developers have been working on similar approaches to DeepSeek. Gemini, for example, uses the “Mixture of Experts” approach also used by DeepSeek. 


The point is that model training and inference costs already were dropping fast, as is typical for all computing processes. 


source: Bain

No comments:

Yes, Follow the Data. Even if it Does Not Fit Your Agenda

When people argue we need to “follow the science” that should be true in all cases, not only in cases where the data fits one’s political pr...