deepseek-infra-index/README.md
2025-02-24 09:31:16 +08:00

41 lines
No EOL
1.7 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

<!-- markdownlint-disable first-line-h1 -->
<!-- markdownlint-disable html -->
<!-- markdownlint-disable no-duplicate-header -->
<div align="center">
<img src="https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/logo.svg?raw=true" width="60%" alt="DeepSeek-Open-Infra" />
</div>
<hr>
# Hello, DeepSeek Open Infra!
## 202502 Open-Source Week
We're a tiny team @deepseek-ai pushing our limits in AGI exploration.
Starting **this week** , Feb 24, 2025 we'll open-source 5 repos one daily drop not because we've made grand claims,
but simply as developers sharing our small-but-sincere progress with full transparency.
These are humble building blocks of our online service: documented, deployed and battle-tested in production.
No vaporware, just sincere code that moved our tiny yet ambitious dream forward.
Why? Because every line shared becomes collective momentum that accelerates the journey.
Daily unlocks begin soon. No ivory towers - just pure garage-energy and community-driven innovation 🔧
Stay tuned let's geek out in the open together.
### Day 1 - FlashMLA
**Efficient MLA Decoding Kernel for Hopper GPUs**
Optimized for variable-length sequences, battle-tested in production
🔗 <a href="https://github.com/deepseek-ai/FlashMLA"><b>GitHub Repo</b></a>
✅ BF16 support
✅ Paged KV cache (block size 64)
⚡ Performance: 3000 GB/s memory-bound | BF16 580 TFLOPS compute-bound on H800
### Ongoing Releases...
## 2024 AI Infrastructure Paper (SC24)
### Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning
<a href="https://dl.acm.org/doi/10.1109/SC41406.2024.00089"><b>📄 Paper Link</b></a>
<a href="https://arxiv.org/abs/2408.14158"><b>📄 Arxiv Paper Link</b></a>