Themes - DeepSeek, Kimi And AI Efficiency Paradigm Shift (Pt.1)
Summary
* DeepSeek is a true innovator, not a copycat. Its model architecture, algos, training and inference frameworks are all firsts in the industry.
* DeepSeek hasn't used smuggled H100s, just 2048 H800s, as per its disclosure.
* For the near term, it should continue to use NVDA cards, but over the longer term, it has a good chance to build its own training library plus other AI ASICs, like Huawei's Ascend AI chips.
* DeepSeek is shifting the GPU demand from memory bound to compute bound