Skip to content

Pull requests: NVIDIA/NeMo

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: Compatibility modification of megatron-fsdp
#14593 opened Aug 27, 2025 by shjwudp Loading…
8 tasks
Support first last N layers in mxfp8
#14591 opened Aug 27, 2025 by WanZzzzzz Loading…
8 tasks
Add gpt-oss lora exporter
#14589 opened Aug 26, 2025 by cuichenx Draft
8 tasks
[Qwen3] MoE 480B A33B model performance recipe
#14580 opened Aug 26, 2025 by gdengk Draft
8 tasks
[Qwen3] Fix flops cal for dense models
#14579 opened Aug 26, 2025 by gdengk Loading…
8 tasks
Update ModelCommPGs API from megatron-core
#14578 opened Aug 25, 2025 by yaoyu-33 Loading…
8 tasks
[Perf script] Llama and GPT3 perf script use mlp cast fusion
#14575 opened Aug 25, 2025 by guyueh1 Loading…
8 tasks
Making sure best_k_models=-1 doesn't delete models
#14571 opened Aug 25, 2025 by marcromeyn Loading…
8 tasks
Enabling DDP Support for LLAMA3 70B LoRa
#14570 opened Aug 25, 2025 by rhmukundan Loading…
Bump TE and Mcore
#14568 opened Aug 24, 2025 by chtruong814 Loading…
8 tasks
AutoTuner in Nemo for Lepton
#14563 opened Aug 22, 2025 by prekshivyas Loading…
8 tasks
autoconfigurator enhancements
#14562 opened Aug 22, 2025 by prekshivyas Loading…
8 tasks
Add Parakeet Hybrid RNNT CTC BPE Model with Prompt support ASR
#14561 opened Aug 22, 2025 by ealbasiri Loading…
5 of 6 tasks
Randomized shard slicing for tarred data common
#14558 opened Aug 22, 2025 by pzelasko Loading…
1 of 8 tasks
Add example for loading
#14557 opened Aug 22, 2025 by BoxiangW Draft
8 tasks
Stop overwriting MCore FSDP config in MegatronStrategy
#14556 opened Aug 21, 2025 by rahimftd Loading…
1 of 8 tasks
ProTip! Updated in the last three days: updated:>2025-08-24.