-
Notifications
You must be signed in to change notification settings - Fork 306
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Utils] Deprecate unused utils
ready
When a PR is ready for review
#2097
opened Dec 5, 2025 by
kylesayrs
Loading…
Remove replace_module_for_calibration
ready
When a PR is ready for review
#2095
opened Dec 4, 2025 by
dsikka
Loading…
Linearize gpt_oss model and add separate example to qunatize it to w4a8
#2091
opened Dec 3, 2025 by
isharif168
Loading…
feat: add importance-aware mixed-precision quantization
#2083
opened Dec 2, 2025 by
wangwenmingaa
Loading…
[AWQ] use match_modules_set and fix logic
awq
For any issue / PR related to AWQ support
ready
When a PR is ready for review
#2070
opened Nov 25, 2025 by
HDCharles
Loading…
[Performance] Batched calibration
ready
When a PR is ready for review
#2054
opened Nov 20, 2025 by
kylesayrs
Loading…
[Misc] Remove When a PR is ready for review
is_moe_model
ready
#2053
opened Nov 20, 2025 by
kylesayrs
Loading…
Modernize transformers module with type hints and generic types
#2034
opened Nov 14, 2025 by
sugatmahanti
Loading…
[Sequential Onloading] Support onloading and offloading frozen dataclasses
#2016
opened Nov 10, 2025 by
kylesayrs
Loading…
Implement When a PR is ready for review
propagate_error argument
ready
#2008
opened Nov 10, 2025 by
kylesayrs
Loading…
[When a PR is ready for review
model_free_ptq] NVFP4A16
ready
#1988
opened Nov 3, 2025 by
kylesayrs
Loading…
[AWQ] Allow users to disable quantization during AWQ
#1973
opened Oct 28, 2025 by
brian-dellabetta
•
Draft
[Oneshot] Add validation for empty dataset and enhance oneshot function parameters
#1957
opened Oct 21, 2025 by
ArkaSanka
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.