Skip to content

Pull requests: NVIDIA/Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add LTX-2 third-party license notices for legal compliance
#1226 opened Apr 9, 2026 by kevalmorabia97 Collaborator Loading…
3 tasks
added gptq nvfp4 default recipe + docstring fix
#1224 opened Apr 9, 2026 by sugunav14 Contributor Loading…
1 task done
GPTQ vector
#1223 opened Apr 9, 2026 by sugunav14 Contributor Draft
fix decoder_layer_cls failure on trust_remote_code models
#1222 opened Apr 9, 2026 by j-rausch Contributor Loading…
Add Gemma4 MoE quantization support
#1219 opened Apr 9, 2026 by yueshen2016 Contributor Loading…
4 tasks done
Add WaterSIC for KV-cache quantization
#1217 opened Apr 9, 2026 by kaix-nv Contributor Draft
Add TriAttention For KV Cache Compression
#1216 opened Apr 9, 2026 by kaix-nv Contributor Draft
Add VLM base model support for auto_quantize in hf_ptq
#1214 opened Apr 9, 2026 by yueshen2016 Contributor Loading…
Add FP8 QKVO + NVFP4 MLP PTQ recipe
#1213 opened Apr 9, 2026 by yueshen2016 Contributor Loading…
add: DFlash block diffusion speculative decoding
#1211 opened Apr 8, 2026 by ChenhanYu Collaborator Loading…
Replace in-repo LLM ONNX export with TensorRT-Edge-LLM
#1210 opened Apr 8, 2026 by ajrasane Contributor Loading…
Upgrade ONNX from 1.19 to 1.21
#1207 opened Apr 8, 2026 by ajrasane Contributor Loading…
[1/N] Polish PTQ skills
#1198 opened Apr 8, 2026 by Edwardf0t1 Contributor Loading…
fix: handle accelerate CPU-offloaded models in FakeQuant export
#1194 opened Apr 8, 2026 by sungsooha Contributor Loading…
Validate non-empty cfg when enabling quantizers in quant_cfg
#1192 opened Apr 7, 2026 by shengliangxu Collaborator Loading…
Simplify KDTrainer and enhance ModelOptHFTrainer
#1191 opened Apr 7, 2026 by realAsma Contributor Loading…
4 of 6 tasks
Generic Fused MoE Quantization + Export for transformers 5.0+
#1187 opened Apr 7, 2026 by Edwardf0t1 Contributor Loading…
2 of 3 tasks
GPTQ test
#1179 opened Apr 6, 2026 by sugunav14 Contributor Draft
[1/N] Refactor llm_qat example: YAML configs + ModelOptArgParser
#1172 opened Apr 2, 2026 by realAsma Contributor Loading…
3 of 4 tasks
ProTip! Exclude everything labeled bug with -label:bug.