-
Notifications
You must be signed in to change notification settings - Fork 340
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add LTX-2 third-party license notices for legal compliance
#1226
opened Apr 9, 2026 by
kevalmorabia97
Collaborator
Loading…
3 tasks
added gptq nvfp4 default recipe + docstring fix
#1224
opened Apr 9, 2026 by
sugunav14
Contributor
Loading…
1 task done
fix decoder_layer_cls failure on trust_remote_code models
#1222
opened Apr 9, 2026 by
j-rausch
Contributor
Loading…
consolidate mbridge distillation: merge distill_hf.py into distill.py
#1220
opened Apr 9, 2026 by
j-rausch
Contributor
Loading…
Add Gemma4 MoE quantization support
#1219
opened Apr 9, 2026 by
yueshen2016
Contributor
Loading…
4 tasks done
Add VLM base model support for auto_quantize in hf_ptq
#1214
opened Apr 9, 2026 by
yueshen2016
Contributor
Loading…
add: DFlash block diffusion speculative decoding
#1211
opened Apr 8, 2026 by
ChenhanYu
Collaborator
Loading…
Replace in-repo LLM ONNX export with TensorRT-Edge-LLM
#1210
opened Apr 8, 2026 by
ajrasane
Contributor
Loading…
Add Z-Image (NextDiT/Lumina2) PTQ quantization support in diffusers example
#1205
opened Apr 8, 2026 by
andrea-pilzer
Loading…
Add support for postprocess exported model for block scale swizzling and support for different padding strategy
#1195
opened Apr 8, 2026 by
ynankani
Contributor
Loading…
fix: handle accelerate CPU-offloaded models in FakeQuant export
#1194
opened Apr 8, 2026 by
sungsooha
Contributor
Loading…
Validate non-empty cfg when enabling quantizers in quant_cfg
#1192
opened Apr 7, 2026 by
shengliangxu
Collaborator
Loading…
Simplify KDTrainer and enhance ModelOptHFTrainer
#1191
opened Apr 7, 2026 by
realAsma
Contributor
Loading…
4 of 6 tasks
Add ModelOpt Triton attention kernels for WAN2.2 diffusion (sparse, skip-softmax, NVFP4)
#1190
opened Apr 7, 2026 by
yeyu-nvidia
Contributor
Loading…
5 tasks
Generic Fused MoE Quantization + Export for transformers 5.0+
#1187
opened Apr 7, 2026 by
Edwardf0t1
Contributor
Loading…
2 of 3 tasks
[chore]: weekly bump of uv.lock on main (2026-04-06)
#1180
opened Apr 6, 2026 by
github-actions
bot
Loading…
feat: parallelize fakequant export across GPUs via ThreadPoolExecutor
#1177
opened Apr 3, 2026 by
sungsooha
Contributor
Loading…
[1/N] Refactor llm_qat example: YAML configs + ModelOptArgParser
#1172
opened Apr 2, 2026 by
realAsma
Contributor
Loading…
3 of 4 tasks
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.