DeepSeek releases DeepGEMM: an efficient FP8 GEMM library that optimizes V3/R1 training and inference | PANews

DeepSeek releases DeepGEMM: an efficient FP8 GEMM library that optimizes V3/R1 training and inference

PANews reported on February 26 that DeepSeek launched DeepGEMM on the third day of its OpenSourceWeek, a CUDA library that supports FP8 GEMM and can be used for dense matrix calculations and mixture of experts (MoE) architecture to optimize the training and inference of V3/R1 models.

DeepGEMM key features:

• Ultra-high performance: 1350+ FP8 TFLOPS on Hopper GPU

• Minimal dependencies: no heavy dependencies, simple code like tutorials

• JIT compilation: no need for pre-compilation, automatic optimization at runtime

• The core code is only about 300 lines, but outperforms expert-optimized kernels for most matrix sizes

• Support dense layout and two MoE layouts

Share to:

Author: PA一线

This content is for market information only and is not investment advice.

Follow PANews official accounts, navigate bull and bear markets together

PANews WeChat Group

Telegram Discussion Group

Telegram News Channel

Recommended Reading

PA一线

05/19/2026, 11:02 AM

DeepSeek: Inputting certain characters that trigger abnormal returned content is a model illusion caused by special characters and does not involve security issues or privacy leaks.

BiyaNews

05/12/2026, 07:48 AM

AI投资版图正在重塑：除了“七巨头”，半导体供应链还有哪些机会？

PA一线

05/09/2026, 10:39 AM

Alibaba did not participate in DeepSeek's financing negotiations, and market sources denied rumors of a "breakup."

PA一线

05/08/2026, 11:47 AM

DeepSeek reportedly plans to raise over 50 billion yuan to advance its commercialization and revenue generation strategy.

PA一线

05/06/2026, 06:21 AM

Foreign media: DeepSeek's valuation is close to $45 billion

PA一线

04/30/2026, 10:55 AM

DeepSeek releases visual primitive reasoning method to enhance multimodal complex reasoning capabilities.

Related Topics

AI Agent: A Journey into Web3 Intelligence

The AI Agen wave of innovation is sweeping the globe. How will it take root and flourish in Web3? Let's embark on this journey of intelligent transformation together.

77 articles

New Project Selection: Capturing the New Web3 Narrative

Web3 is entering an era of great development, with new projects emerging continuously. This special topic collects, organizes, and screens these new projects, hoping that readers can capture new narratives from them.

440 articles

Tips for Crypto Entrepreneurs

A must-know guide for crypto entrepreneurs, covering technology stack, marketing, economic models, and more.

144 articles

Trending:BTC Ethereum Stablecoins Prediction Market Trump RWA USDT DeFi AI Federal Reserve Chairman

Popular Articles

IOSG: Crypto didn't die after its developer count was halved; it simply made way for AI talent.

债市给了AI牛市一记闷棍

吴晓波频道

万亿级银行巨头调仓：狂买XRP，清仓Solana

Stablecoin infrastructure company Checker raises $8 million in funding.

盘点30多家人形机器人公司：谁能在2026年胜出？

Industry News

Market Trends

Curated Readings

PANews App

24/7 blockchain news tracking and in-depth analysis.

Download PANews App

App Store Google Play

Robinhood launches ALGO trading, covering users in New York State.

PANews Newsflash11 minutes ago