-
Notifications
You must be signed in to change notification settings - Fork 145
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: Fix checkpoint overriding
CI:L1
Run doctests, unit tests, and functional tests
#1255
opened Oct 2, 2025 by
terrykong
Loading…
4 tasks
feat: add valid_tokens_per_sec metric and total_valid_tokens to save state
CI:L1
Run doctests, unit tests, and functional tests
r0.4.0
#1249
opened Oct 2, 2025 by
terrykong
Loading…
feat: Using mcore cpu optimizer
CI:L1
Run doctests, unit tests, and functional tests
#1242
opened Oct 1, 2025 by
guyueh1
Loading…
4 tasks
perf: Add a field in megatron_cfg to enable bias_activation_fusion
CI:L0
Run doctests and unit tests
#1241
opened Oct 1, 2025 by
katec846
Loading…
4 tasks
docs: add missing async_grpo.enabled flag to configuration
asyncRL
CI:docs
Run doctest
documentation
Improvements or additions to documentation
r0.4.0
#1237
opened Sep 30, 2025 by
youngeunkwon0405
Loading…
4 tasks
chore: Log the initial training master config
#1232
opened Sep 29, 2025 by
pjin-nvidia
Loading…
4 tasks
fix: fix github to myst-parser admonition conversion
CI:docs
Run doctest
documentation
Improvements or additions to documentation
#1224
opened Sep 29, 2025 by
terrykong
Loading…
draft feat: KV cache quantization support in fp8 rollout in GRPO
Low Precision
#1212
opened Sep 26, 2025 by
sharonyu-115
Loading…
4 tasks
feat: Multi-turn tool calling on BFCLv3 dataset
community-request
documentation
Improvements or additions to documentation
#1207
opened Sep 25, 2025 by
slikhite-1
Loading…
feat: Compute entropy across full vocab for logging
r0.4.0
#1200
opened Sep 24, 2025 by
parthchadha
Loading…
4 tasks
ci: Test runner
CI:L1
Run doctests, unit tests, and functional tests
CI
Relating to CI
#1196
opened Sep 23, 2025 by
chtruong814
Loading…
4 tasks
refactor: unify get_logprobs() and score() logic in dtensor
CI:L1
Run doctests, unit tests, and functional tests
#1173
opened Sep 21, 2025 by
RayenTian
Loading…
fix: simplified megatron to hf conversion script
r0.4.0
#1169
opened Sep 20, 2025 by
ahmadki
Loading…
4 tasks
fix: Fix and add new tests for FLOPs accountant
CI:L1
Run doctests, unit tests, and functional tests
r0.4.0
#1149
opened Sep 17, 2025 by
ybgao-nvidia
Loading…
4 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-09-02.