From bf880385d0a6aca2b68ee7d59862c9b5b1369ef0 Mon Sep 17 00:00:00 2001 From: hpp Date: Thu, 27 Feb 2025 10:39:41 +0800 Subject: [PATCH] add day 4 --- README.md | 9 +++++++++ 1 file changed, 9 insertions(+) diff --git a/README.md b/README.md index 8a65444..e241363 100644 --- a/README.md +++ b/README.md @@ -56,6 +56,15 @@ Introducing DeepGEMM - an FP8 GEMM library that supports both dense and MoE GEMM 🔗 GitHub: https://github.com/deepseek-ai/DeepGEMM +### Day 4 - Optimized Parallelism Strategies +✅ DualPipe - a bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training. +🔗 https://github.com/deepseek-ai/DualPipe + +✅ EPLB - an expert-parallel load balancer for V3/R1. + 🔗 https://github.com/deepseek-ai/eplb + +📊 Analyze computation-communication overlap in V3/R1. + 🔗 https://github.com/deepseek-ai/profile-data ### Ongoing Releases...