Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][chore] Cleanup KV cache manager
#8396 opened Oct 15, 2025 by Funatiq Draft
1 task
[None][fix] improve mpirun hang issues
#8385 opened Oct 15, 2025 by xinhe-nv Draft
1 task
Ampere xqa swa specdec
#8383 opened Oct 15, 2025 by jhaotingc Loading…
1 task
[TRTLLM-8464][infra] Use public triton 3.5.0
#8382 opened Oct 15, 2025 by ZhanruiSunCh Loading…
1 task
[None][fix] fix error when processing batches containing both text and mm data Community want to contribute PRs initiated from Community
#8381 opened Oct 15, 2025 by Nekofish-L Loading…
1 task done
[None][chore] update test duration
#8377 opened Oct 15, 2025 by xinhe-nv Loading…
1 task done
[None][fix] Fix request_id for best_of/n case
#8368 opened Oct 14, 2025 by evezhier Loading…
1 task done
[TRTLLM-6741] [feat] Extend lm_head_tp_size to world_size
#8367 opened Oct 14, 2025 by kaiyux Loading…
1 task
[None][feat] Add max_total_draft_tokens
#8366 opened Oct 14, 2025 by yweng0828 Loading…
1 task done
[https://nvbugs/5542862][fix] Upgrade fmha_v2.
#8364 opened Oct 14, 2025 by yuxianq Loading…
1 task done
[https://nvbugs/5461761][fix] Unwaive eagle3 test
#8363 opened Oct 14, 2025 by sunnygqq Loading…
1 task done
[None][feat] Dev DeepConf
#8362 opened Oct 14, 2025 by dcaox Loading…
1 task done
ProTip! Follow long discussions with comments:>50.