Sign in Subscribe

R1

Themes - DeepSeek, Kimi And AI Efficiency Paradigm Shift (Pt.1)

Themes - DeepSeek, Kimi And AI Efficiency Paradigm Shift (Pt.1)

Summary * DeepSeek is a true innovator, not a copycat. Its model architecture, algos, training and inference frameworks are all firsts in the industry. * DeepSeek hasn't used smuggled H100s, just 2048 H800s, as per its disclosure. * For the near term, it should continue to use NVDA cards, but over the longer term, it has a good chance to build its own training library plus other AI ASICs, like Huawei's Ascend AI chips. * DeepSeek is shifting the GPU demand from memory bound to compute bound

23 min read 10 Feb 2025

Contact Footer Example