Skip to content

Pull requests: NVIDIA/TensorRT-Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix ONNX FP8 scaling
#446 opened Oct 17, 2025 by Darth-Kronos Loading…
[OMNIML-2857] Support the DeepSeek V3.2 model
#435 opened Oct 14, 2025 by cjluo-nv Loading…
Add example for multinode calibration using FSDP2
#432 opened Oct 13, 2025 by sugunav14 Loading…
2 of 5 tasks
Yeyu/debug paralllel draft
#429 opened Oct 13, 2025 by yeyu-nvidia Loading…
[New feature] Add Support For Sparse Attention
#408 opened Oct 7, 2025 by kaix-nv Loading…
Explicitly register real quant gemms
#402 opened Oct 6, 2025 by cjluo-nv Loading…
megatron realquant FP8 WIP
#367 opened Sep 24, 2025 by cjluo-nv Draft
ProTip! What’s not been updated in a month: updated:<2025-09-17.