Merge pull request #61 from danielhua23/a2a_rule_clarify

msaroufim · web-flow · commit 54cb94ec922b · 2025-09-08T23:48:19.000-04:00
diff --git a/problems/amd_distributed/all2all/task.yml b/problems/amd_distributed/all2all/task.yml
@@ -15,7 +15,7 @@ config:
 
 description: |
   
-  You will implement a custom single node all2all kernel optimized for 8xMI300.
+  You are expected to implement dispatch and simulated moe and combine kernels with intra node communication, refering to reference.py, which jointly made a custom single node all2all kernel optimized for 8xMI300.
   You will be given MoEConfig, which is the main hypeparameter, including numbers of experts, experts per token, hidden dim, max number tokens each dp rank and input output dtype
 
   To be explicit, you will be given data of all ranks, naming all_rank_data.