-
Notifications
You must be signed in to change notification settings - Fork 177
Pull requests: NVIDIA/TensorRT-Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Bump trtllm to 1.2.0rc0.post1 and pytorch to 25.08 for cuda 13
#445
opened Oct 17, 2025 by
kevalmorabia97
Loading…
1 task
Add SD3.5-medium quantization support in ModelOpt Diffusers example
#444
opened Oct 17, 2025 by
vishalpandya1990
Loading…
[Autocast] Add low precision autocasting support for Resize op
#436
opened Oct 14, 2025 by
aboubezari
Loading…
Cleanup mixed precision and gather node layer info mapping
#434
opened Oct 14, 2025 by
ynankani
Loading…
Add example for multinode calibration using FSDP2
#432
opened Oct 13, 2025 by
sugunav14
Loading…
2 of 5 tasks
Fix megatron distributed checkpoint metadata pass through
#431
opened Oct 13, 2025 by
ChenhanYu
Loading…
Ensure that the ONNX IR version is the max supported version (10)
#416
opened Oct 9, 2025 by
gcunhase
Loading…
Update onnx ptq test to be single threaded and make it faster
#415
opened Oct 8, 2025 by
ajrasane
Loading…
[4975376][5541172]perplexity and kl-divergence benchmark metrics
#411
opened Oct 8, 2025 by
ynankani
Loading…
Added support for quantizing TEGroupedMLP for megatron-lm
#403
opened Oct 7, 2025 by
kinjalpatel27
Loading…
EAGLE parallel draft with auto regression; kv cache in EAGLE training
#391
opened Sep 29, 2025 by
yeyu-nvidia
Loading…
[5545101]: AutoCast: Add options to force include node/op in F16
#386
opened Sep 28, 2025 by
galagam
Loading…
Support kv cache quantization for mcore using bmm_quantizers
#375
opened Sep 25, 2025 by
kaix-nv
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-09-17.