-
Notifications
You must be signed in to change notification settings - Fork 364
Insights: pytorch/TensorRT
Overview
Could not load contribution data
Please try again later
49 Pull requests merged by 9 people
-
slice scatter support for dynamic cases
#3513 merged
Jul 7, 2025 -
fix compiler cl not found error in windows
#3660 merged
Jul 7, 2025 -
Lluo/cherry pick 3620
#3658 merged
Jul 4, 2025 -
Refit debug patch
#3620 merged
Jul 4, 2025 -
fix: replace add_identity by add_cast for type cast
#3563 merged
Jul 4, 2025 -
feat: Add bf16 support to cast converter
#3643 merged
Jul 4, 2025 -
adding rotary embedding example, with graph rewrite for complex subgraph
#3570 merged
Jul 3, 2025 -
Lluo/cherry pick 3629
#3656 merged
Jul 3, 2025 -
cherry pick 3642
#3655 merged
Jul 3, 2025 -
feat: Refactor LLM model zoo and add KV cache support
#3527 merged
Jul 3, 2025 -
Closed the perf gap of resnet and enabled refit
#3629 merged
Jul 2, 2025 -
Fix dynamo core test failure on Windows
#3642 merged
Jul 2, 2025 -
cherry pick 3636
#3640 merged
Jun 30, 2025 -
release 2.8 branch cut
#3638 merged
Jun 30, 2025 -
Temporary fix to workaround the mutable decomposition error.
#3636 merged
Jun 30, 2025 -
upgrade torch_tensorrt version from 2.8.0.dev to 2.9.0.dev
#3639 merged
Jun 30, 2025 -
fix the broken CC0 image link
#3635 merged
Jun 30, 2025 -
disable aoti format in windows
#3632 merged
Jun 28, 2025 -
disable test on aarch64 for now
#3623 merged
Jun 27, 2025 -
fix ts fe test error.
#3619 merged
Jun 26, 2025 -
fix: Fix unbacked sym int not found issue
#3617 merged
Jun 26, 2025 -
remove debug flag
#3618 merged
Jun 25, 2025 -
a few CI changes
#3612 merged
Jun 25, 2025 -
TorchTensorRTModule Serialization Fix
#3572 merged
Jun 24, 2025 -
fix cross compilation test bug
#3609 merged
Jun 24, 2025 -
py39 does not like | E TypeError: unsupported operand type(s) for |: 'type' and 'EnumMeta'
#3611 merged
Jun 24, 2025 -
fix: pre-commit issues
#3603 merged
Jun 19, 2025 -
fix: add prefix in hierarchical_partitioner_example
#3607 merged
Jun 18, 2025 -
debug flag is deprecated, remove it so that test won't complain
#3610 merged
Jun 18, 2025 -
fix enabled_precisions error in test cases
#3606 merged
Jun 18, 2025 -
skip flashinfer-python for py3.9 due to upstream error
#3605 merged
Jun 18, 2025 -
feat: Saving modules using the AOTI format
#3567 merged
Jun 18, 2025 -
fix dynamo converter test case failure
#3594 merged
Jun 18, 2025 -
feat: Hierarchical Partitioner to support multi-backends
#3539 merged
Jun 18, 2025 -
Import dllist only on linux
#3592 merged
Jun 17, 2025 -
test failed but displayed as green
#3599 merged
Jun 17, 2025 -
Changed weight map to tensor and fix the refit bug
#3573 merged
Jun 17, 2025 -
remove torchvision dependency from build, optional for test
#3598 merged
Jun 17, 2025 -
feat: working uv pyproject.toml
#3597 merged
Jun 17, 2025 -
Fix: fix aot plugin example docstring issue
#3595 merged
Jun 17, 2025 -
Verify C++ tests, fix cuda graphs union issue
#3589 merged
Jun 16, 2025 -
fix main test failure bug
#3590 merged
Jun 16, 2025 -
FX graph visualization
#3528 merged
Jun 13, 2025 -
Added flux demo
#3418 merged
Jun 13, 2025 -
fix: Fix a perf regression due to weights being ITensors
#3568 merged
Jun 13, 2025 -
Add fp4 support
#3532 merged
Jun 13, 2025 -
add --no-deps for tests/py/requirements.txt
#3569 merged
Jun 12, 2025 -
fix: Fix constant folding failure due to modelopt
#3565 merged
Jun 11, 2025 -
Cross compile guard
#3486 merged
Jun 11, 2025
19 Pull requests opened by 8 people
-
Cpu memory optimization
#3602 opened
Jun 17, 2025 -
Remove Bazel files from wheel
#3615 opened
Jun 25, 2025 -
add weekly cron job to upgrade uv lock
#3616 opened
Jun 25, 2025 -
Removal of BAZEL build files from python package and changes to make cpp tests work
#3641 opened
Jul 1, 2025 -
Upgrade perf_run script to support TRT 10 and fix some issues
#3650 opened
Jul 2, 2025 -
[kv_cache] integrated vlm code for benchmark (Stacked on #3527)
#3652 opened
Jul 3, 2025 -
chore: Fix path in FLUX demo
#3654 opened
Jul 3, 2025 -
Add Flux fp4 support
#3657 opened
Jul 4, 2025 -
cherry pick 3660 to release/2.8
#3661 opened
Jul 7, 2025 -
Cherrypick of PR 3570
#3662 opened
Jul 7, 2025 -
fix the int8 quantization failure error
#3663 opened
Jul 7, 2025 -
Cherrypick of PR 3513
#3664 opened
Jul 7, 2025 -
cherry pick 3663: fix the int8 quantization error, remove duplicated lines
#3665 opened
Jul 7, 2025 -
chore: cherry pick of bf16 cast PR (3643)
#3666 opened
Jul 7, 2025 -
chore: cherry pick of KV cache PR (3527)
#3667 opened
Jul 8, 2025 -
chore(deps): bump transformers from 4.49.0 to 4.51.0 in /tests/py
#3668 opened
Jul 8, 2025 -
chore(deps): bump transformers from 4.50.0 to 4.51.0 in /examples/dynamo
#3669 opened
Jul 8, 2025 -
chore(deps): bump transformers from 4.48.0 to 4.52.1 in /tests/modules
#3670 opened
Jul 8, 2025 -
Lluo/lan fix torch compile gpt2
#3671 opened
Jul 8, 2025
27 Issues closed by 7 people
-
🐛 [Bug] `require_full_compilation` never reaches partitioner
#3171 closed
Jul 8, 2025 -
❓ [Question] Why is `torch.bfloat16` excluded from the `allowed_casts` set ?
#3637 closed
Jul 4, 2025 -
🐛 [Bug] current BN implementation results in slower performance
#3628 closed
Jul 2, 2025 -
🐛 [Bug] Windows dynamo core tests failed
#3631 closed
Jul 2, 2025 -
🐛 [Bug] .tests.py.ts.api.test_classes
#3576 closed
Jul 1, 2025 -
🐛 [Bug] tests.py.dynamo.models.test_dtype_support.TestBF16Support.test_bf16_cpp
#3579 closed
Jul 1, 2025 -
🐛 [Bug] tests.py.dynamo.models.test_dyn_models.test_base_dynamic
#3580 closed
Jul 1, 2025 -
🐛 [Bug] MutableModule Test Failures
#3586 closed
Jul 1, 2025 -
🐛 [Bug] Dynamo core test failed in linux
#3633 closed
Jul 1, 2025 -
🐛 [Bug] Windows dynamo export serde test failed
#3630 closed
Jun 28, 2025 -
🐛 [Bug] Test failure `tests.py.ts.api.test_classes`
#3578 closed
Jun 28, 2025 -
🐛 [Bug] Reexport test failures
#3588 closed
Jun 26, 2025 -
🐛 [Bug] Using `torch_executed_ops` causes the metadata packing to fail
#3566 closed
Jun 24, 2025 -
🐛 [Bug] Refit test failures
#3587 closed
Jun 19, 2025 -
🐛 [Bug] Workflow issue: the tests are failed but showing as green in the workflow UI
#3593 closed
Jun 19, 2025 -
🐛 [Bug] mypy precommit issues
#3600 closed
Jun 19, 2025 -
🐛 [Bug] ModuleNotFoundError: No module named 'dllist'
#3591 closed
Jun 17, 2025 -
❓ [Question] Can I export a serialized engine from Torch-TensorRT targeting TensorRT 10.3.0.26?
#3571 closed
Jun 16, 2025 -
🐛 [Bug] Severe performance regression with 2.7 and 2.8.dev
#3561 closed
Jun 13, 2025 -
🐛 [Bug] AttributeError: '_OpNamespace' 'tensorrt' object has no attribute 'quantize_op'
#3560 closed
Jun 13, 2025 -
Constant folding crashes if model opt is not installed
#3564 closed
Jun 11, 2025
19 Issues opened by 10 people
-
🐛 [Bug] dynamo.compile fails in 2.9.0 nightly when mqt quantization is used
#3659 opened
Jul 7, 2025 -
🐛 [Bug] examples/dynamo/torch_compile_gpt2.py run failed
#3653 opened
Jul 3, 2025 -
🐛 [Bug] FLUX Accuracy Issue
#3651 opened
Jul 2, 2025 -
🐛 [Bug] Draw svg for flux model hangs
#3649 opened
Jul 1, 2025 -
🐛 [Bug] group_norm implementation may slows down SD vae
#3648 opened
Jul 1, 2025 -
🐛 [Bug] index and index_put don't support dynamic shapes
#3647 opened
Jul 1, 2025 -
🐛 [Bug] Flux does not produce the correct image
#3646 opened
Jul 1, 2025 -
🐛 [Bug] with torch_tensorrt.dynamo.Debugger is not working with MutableTorchTensorRTModule
#3644 opened
Jul 1, 2025 -
🐛 [Bug] Changing input size would affect the TRT engine size, testing on BERT
#3634 opened
Jun 28, 2025 -
📖 [Story] Performance improvements/gaps in Torch-TensorRT
#3627 opened
Jun 26, 2025 -
🐛 [Bug] Large Accuracy Issue
#3626 opened
Jun 26, 2025 -
✨[Feature] Integrate release pipeline with test pipeline
#3625 opened
Jun 26, 2025 -
✨[Feature] Automate examples verification in the release pipeline
#3624 opened
Jun 26, 2025 -
compile the QAT trained model in with TensorRT
#3622 opened
Jun 26, 2025 -
error in data types
#3614 opened
Jun 25, 2025 -
🐛 [Bug] Flux perf scripts issue
#3601 opened
Jun 17, 2025 -
🐛 [Bug] torchtrt.dynamo.compile produces nan values
#3596 opened
Jun 16, 2025 -
🐛 [Bug] Weight streaming test failures
#3585 opened
Jun 14, 2025
7 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
TRT-LLM loading mechanism tool
#3398 commented on
Jul 7, 2025 • 17 new comments -
🐛 [Bug] AssertionError: end must be an integer
#3448 commented on
Jun 12, 2025 • 0 new comments -
Is it possible to enable Github Merge Queues for our Repos?
#3559 commented on
Jun 17, 2025 • 0 new comments -
❓ [Question] Is SAM2 supported when compiling with the Dynamo backend on JetPack 6.1 or 6.2?
#3478 commented on
Jun 28, 2025 • 0 new comments -
🐛 [Bug] Encountered bug when using Torch-TensorRT
#3477 commented on
Jun 30, 2025 • 0 new comments -
Debugging guide for TensorRT
#3489 commented on
Jun 9, 2025 • 0 new comments -
AOT plugin: examples with RMSNORM
#3529 commented on
Jun 12, 2025 • 0 new comments