Perhaps I will change my mind as I learn more, but right now DeepSeek has added some clever innovation to language model training and inference costs. But other key contestants seem to be working in parallel.
Watch for coming releases from Anthropic and OpenAI, for example. And other developers have been working on similar approaches to DeepSeek. Gemini, for example, uses the “Mixture of Experts” approach also used by DeepSeek.
The point is that model training and inference costs already were dropping fast, as is typical for all computing processes.
source: Bain
No comments:
Post a Comment