Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix bug when using dataset streaming by accelerate
#3950 opened Aug 25, 2025 by kaixuanliu Loading…
Update grpo_trainer.py to fix the dim error.
#3943 opened Aug 23, 2025 by HelloWorldLTY Loading…
Return position_ids for flash_attention_3
#3942 opened Aug 23, 2025 by jue-jue-zi Loading…
5 tasks
Docker update
#3931 opened Aug 20, 2025 by qgallouedec Loading…
5 tasks
[SFTTrainer]: Check for assistant mask up to max_length
#3930 opened Aug 20, 2025 by pramodith Loading…
2 of 5 tasks
Support for pre-defined image positions in VLM training data
#3911 opened Aug 17, 2025 by YeFD Loading…
3 of 5 tasks
[DRAFT] Refactor DPO
#3906 opened Aug 15, 2025 by qgallouedec Draft
5 tasks
Test in distributed setting
#3902 opened Aug 15, 2025 by qgallouedec Loading…
5 tasks
BEMA for ref model
#3898 opened Aug 14, 2025 by qgallouedec Loading…
5 tasks
validate examples on xpu
#3897 opened Aug 14, 2025 by yao-matrix Loading…
🧭 HF jobs x TRL guide
#3890 opened Aug 13, 2025 by sergiopaniego Loading…
3 of 12 tasks
Implement DPOP
#3864 opened Aug 7, 2025 by 1485840691 Loading…
Update profiling.py: fix scoping problems for wandb and mlflow
#3845 opened Aug 4, 2025 by markshinyounglee Loading…
5 tasks done
dynamic temperature
#3844 opened Aug 4, 2025 by shirinyamani Draft
5 tasks
[GSPO]: Refactor _compute_loss
#3835 opened Aug 1, 2025 by pramodith Loading…
2 of 5 tasks
support GSPO-token
#3820 opened Jul 31, 2025 by hjh0119 Loading…
Rloo final
#3801 opened Jul 29, 2025 by shirinyamani Loading…
5 tasks
Add vLLM server mode and VLM support to OnlineDPOTrainer
#3783 opened Jul 27, 2025 by vaelev Loading…
6 tasks done
ProTip! What’s not been updated in a month: updated:<2025-07-25.