Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix wonky dependency range on datasets
#1774 opened Aug 22, 2025 by timkpaine Loading…
[MoE] Llama4 and More tests
#1760 opened Aug 19, 2025 by kylesayrs Draft
[Transform] SpinQuant R4
#1746 opened Aug 18, 2025 by kylesayrs Draft
[bugfix] Fix indentation errors in the README file
#1737 opened Aug 15, 2025 by qibaoyuan Loading…
Enable xpu device
#1736 opened Aug 15, 2025 by jiqing-feng Loading…
[Utils] Offloaded cache size
#1714 opened Aug 7, 2025 by kylesayrs Loading…
[Tracing] Decouple vision tower from first layer ready When a PR is ready for review
#1710 opened Aug 6, 2025 by kylesayrs Loading…
[WIP] [MoE] GPT OSS
#1705 opened Aug 5, 2025 by kylesayrs Draft
[Example] [VLM] Gemma3n
#1696 opened Jul 31, 2025 by kylesayrs Draft
1686 Logic matching refactor
#1687 opened Jul 28, 2025 by ved1beta Loading…
add quantization_w4a4_fp4 qwen3 example
#1681 opened Jul 24, 2025 by wangwenmingaa Loading…
[KV Cache] support kv cache int8 per channel quantization ready When a PR is ready for review
#1663 opened Jul 19, 2025 by Eviannn Loading…
[Transform] Online Rotations
#1651 opened Jul 16, 2025 by kylesayrs Draft
[Pipelines] Add propagate_error argument ready When a PR is ready for review
#1575 opened Jun 20, 2025 by kylesayrs Draft
[GPTQ] Use torch.compile to speed up gptq algo ready When a PR is ready for review
#1561 opened Jun 17, 2025 by aladerran Loading…
Disable sequential_targets from modifiers ready When a PR is ready for review
#1559 opened Jun 16, 2025 by kylesayrs Draft
ProTip! Add no:assignee to see everything that’s not assigned.