mirror of
https://github.com/deepseek-ai/open-infra-index.git
synced 2025-04-03 16:54:04 +00:00
Update README.md
This commit is contained in:
parent
be36f16c7f
commit
b4998cec46
1 changed files with 4 additions and 0 deletions
|
@ -49,9 +49,13 @@ Excited to introduce **DeepEP** - the first open-source EP communication library
|
||||||
Introducing DeepGEMM - an FP8 GEMM library that supports both dense and MoE GEMMs, powering V3/R1 training and inference.
|
Introducing DeepGEMM - an FP8 GEMM library that supports both dense and MoE GEMMs, powering V3/R1 training and inference.
|
||||||
|
|
||||||
⚡ Up to 1350+ FP8 TFLOPS on Hopper GPUs
|
⚡ Up to 1350+ FP8 TFLOPS on Hopper GPUs
|
||||||
|
|
||||||
✅ No heavy dependency, as clean as a tutorial
|
✅ No heavy dependency, as clean as a tutorial
|
||||||
|
|
||||||
✅ Fully Just-In-Time compiled
|
✅ Fully Just-In-Time compiled
|
||||||
|
|
||||||
✅ Core logic at ~300 lines - yet outperforms expert-tuned kernels across most matrix sizes
|
✅ Core logic at ~300 lines - yet outperforms expert-tuned kernels across most matrix sizes
|
||||||
|
|
||||||
✅ Supports dense layout and two MoE layouts
|
✅ Supports dense layout and two MoE layouts
|
||||||
|
|
||||||
🔗 GitHub: https://github.com/deepseek-ai/DeepGEMM
|
🔗 GitHub: https://github.com/deepseek-ai/DeepGEMM
|
||||||
|
|
Loading…
Add table
Reference in a new issue