-
Notifications
You must be signed in to change notification settings - Fork 213
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Multi-modifier] Support scoped appliation of quantization config/status
#1772
opened Aug 21, 2025 by
brian-dellabetta
•
Draft
3 of 6 tasks
[Tests] Add recovery-based validation to LM-Eval tests
#1750
opened Aug 18, 2025 by
rahul-tuli
•
Draft
2 of 7 tasks
[Tracing] Decouple vision tower from first layer
ready
When a PR is ready for review
#1710
opened Aug 6, 2025 by
kylesayrs
Loading…
[Autowrapper] Support Gemma3, autowrapper improvements
#1693
opened Jul 30, 2025 by
kylesayrs
Loading…
[KV Cache] support kv cache int8 per channel quantization
ready
When a PR is ready for review
#1663
opened Jul 19, 2025 by
Eviannn
Loading…
Minor speedup for
infer_quantization_format
when save_compressed=False
#1636
opened Jul 10, 2025 by
kylesayrs
Loading…
[GPTQ] Use torch.compile to speed up gptq algo
ready
When a PR is ready for review
#1561
opened Jun 17, 2025 by
aladerran
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.