PANews reported on January 21 that, according to QuantumBit, the name "MODEL1" appeared for the first time in DeepSeek's updated FlashMLA code on GitHub, appearing in 28 mentions across 114 files and listed alongside the existing version V32 (DeepSeek-V3.2), suggesting that MODEL1 is a next-generation architecture model. Code differences indicate that the model has been optimized in areas such as KV cache layout, sparsity handling, and FP8 decoding, and may be officially released around the Spring Festival. Combined with the recently disclosed mHC residual connection mechanism and Engram memory module, MODEL1 is expected to integrate several self-developed innovations.
DeepSeek's new model MODEL1 code has been leaked, suggesting a completely new architecture.
Share to:
Author: PA一线
This content is for market information only and is not investment advice.
Follow PANews official accounts, navigate bull and bear markets together
Recommended Reading
DeepSeek releases DeepSeek-OCR 2, enabling AI to "see" an image in the same logical order as humans.
PANews App
24/7 blockchain news tracking and in-depth analysis.

