[Bugfix] LLama4: fix for llama4 multimodal support #5809

chang-l · 2025-07-07T22:19:48Z

Llama4 multimodal support

Description

Small fix for the following error from

TensorRT-LLM/tensorrt_llm/_torch/models/modeling_llama.py

Line 669 in a1235ee

    
           "You cannot specify both input_ids and inputs_embeds at the same time, and must specify either one"

Test Coverage

python3 quickstart_multimodal.py --model_dir meta-llama/Llama-4-Scout-17B-16E-Instruct --modality image --tp_size 8 --max_num_tokens 4096

GitHub Bot Help

/bot [-h] ['run', 'kill', 'skip', 'reuse-pipeline'] ...

Provide a user friendly way for developers to interact with a Jenkins server.

Run /bot [-h|--help] to print this help message.

See details below for each supported subcommand.

run [--disable-fail-fast --skip-test --stage-list "A10-1, xxx" --gpu-type "A30, H100_PCIe" --add-multi-gpu-test --only-multi-gpu-test --disable-multi-gpu-test --post-merge --extra-stage "H100_PCIe-[Post-Merge]-1, xxx"]

Launch build/test pipelines. All previously running jobs will be killed.

--disable-fail-fast (OPTIONAL) : Disable fail fast on build/tests/infra failures.

--skip-test (OPTIONAL) : Skip all test stages, but still run build stages, package stages and sanity check stages. Note: Does NOT update GitHub check status.

--stage-list "A10-1, xxx" (OPTIONAL) : Only run the specified test stages. Examples: "A10-1, xxx". Note: Does NOT update GitHub check status.

--gpu-type "A30, H100_PCIe" (OPTIONAL) : Only run the test stages on the specified GPU types. Examples: "A30, H100_PCIe". Note: Does NOT update GitHub check status.

--only-multi-gpu-test (OPTIONAL) : Only run the multi-GPU tests. Note: Does NOT update GitHub check status.

--disable-multi-gpu-test (OPTIONAL) : Disable the multi-GPU tests. Note: Does NOT update GitHub check status.

--add-multi-gpu-test (OPTIONAL) : Force run the multi-GPU tests. Will also run L0 pre-merge pipeline.

--post-merge (OPTIONAL) : Run the L0 post-merge pipeline instead of the ordinary L0 pre-merge pipeline.

--extra-stage "H100_PCIe-[Post-Merge]-1, xxx" (OPTIONAL) : Run the ordinary L0 pre-merge pipeline and specified test stages. Examples: --extra-stage "H100_PCIe-[Post-Merge]-1, xxx".

For guidance on mapping tests to stage names, see docs/source/reference/ci-overview.md.

kill

kill

Kill all running builds associated with pull request.

skip

skip --comment COMMENT

Skip testing for latest commit on pull request. --comment "Reason for skipping build/test" is required. IMPORTANT NOTE: This is dangerous since lack of user care and validation can cause top of tree to break.

reuse-pipeline

reuse-pipeline

Reuse a previous pipeline to validate current commit. This action will also kill all currently running builds associated with the pull request. IMPORTANT NOTE: This is dangerous since lack of user care and validation can cause top of tree to break.

rakib-hasan

LGTM

chang-l · 2025-07-07T22:27:58Z

/bot run --disable-fail-fast

tensorrt-cicd · 2025-07-07T22:33:37Z

PR_Github #11185 [ run ] triggered by Bot

tensorrt-cicd · 2025-07-08T00:25:22Z

PR_Github #11185 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #8274 completed with status: 'FAILURE'

chang-l · 2025-07-08T00:41:37Z

/bot run --disable-fail-fast

Signed-off-by: Chang Liu <[email protected]>

chang-l · 2025-07-08T04:40:47Z

/bot run

tensorrt-cicd · 2025-07-08T04:46:23Z

PR_Github #11208 [ run ] triggered by Bot

tensorrt-cicd · 2025-07-08T05:09:13Z

PR_Github #11208 [ run ] completed with state FAILURE
/LLM/main/L0_MergeRequest_PR pipeline #8292 completed with status: 'FAILURE'

chang-l · 2025-07-08T16:30:26Z

/bot run

tensorrt-cicd · 2025-07-08T16:35:43Z

PR_Github #11333 [ run ] triggered by Bot

tensorrt-cicd · 2025-07-08T21:41:00Z

PR_Github #11333 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #8384 completed with status: 'FAILURE'

chang-l · 2025-07-08T22:15:02Z

/bot run

tensorrt-cicd · 2025-07-08T22:20:35Z

PR_Github #11347 [ run ] triggered by Bot

tensorrt-cicd · 2025-07-09T04:03:24Z

PR_Github #11347 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #8398 completed with status: 'SUCCESS'
Pipeline passed with automatic retried tests. Check the rerun report for details.

Signed-off-by: Yuxin <[email protected]>

fix llama4-vl

eff24ab

chang-l requested review from milesial and symphonylyh July 7, 2025 22:20

chang-l marked this pull request as ready for review July 7, 2025 22:21

chang-l requested a review from a team as a code owner July 7, 2025 22:21

chang-l requested a review from juney-nvidia July 7, 2025 22:21

rakib-hasan approved these changes Jul 7, 2025

View reviewed changes

symphonylyh approved these changes Jul 7, 2025

View reviewed changes

chang-l enabled auto-merge (squash) July 7, 2025 22:44

yechank-nvidia approved these changes Jul 8, 2025

View reviewed changes

Merge branch 'main' into fix-llama4-vl

6bd61b0

Signed-off-by: Chang Liu <[email protected]>

formatting

67d86b9

Merge branch 'main' into fix-llama4-vl

545c11e

chang-l merged commit 4df5f96 into NVIDIA:main Jul 9, 2025
3 checks passed

zhou-yuxin pushed a commit to zhou-yuxin/TensorRT-LLM that referenced this pull request Jul 15, 2025

[Bugfix] LLama4: fix for llama4 multimodal support (NVIDIA#5809)

a676bda

Signed-off-by: Yuxin <[email protected]>

[Bugfix] LLama4: fix for llama4 multimodal support #5809

[Bugfix] LLama4: fix for llama4 multimodal support #5809

Uh oh!

Conversation

chang-l commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Llama4 multimodal support

Description

Test Coverage

GitHub Bot Help

kill

skip

reuse-pipeline

Uh oh!

rakib-hasan left a comment

Choose a reason for hiding this comment

Uh oh!

chang-l commented Jul 7, 2025

Uh oh!

tensorrt-cicd commented Jul 7, 2025

Uh oh!

tensorrt-cicd commented Jul 8, 2025

Uh oh!

chang-l commented Jul 8, 2025

Uh oh!

chang-l commented Jul 8, 2025

Uh oh!

tensorrt-cicd commented Jul 8, 2025

Uh oh!

tensorrt-cicd commented Jul 8, 2025

Uh oh!

chang-l commented Jul 8, 2025

Uh oh!

tensorrt-cicd commented Jul 8, 2025

Uh oh!

tensorrt-cicd commented Jul 8, 2025

Uh oh!

chang-l commented Jul 8, 2025

Uh oh!

tensorrt-cicd commented Jul 8, 2025

Uh oh!

tensorrt-cicd commented Jul 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

chang-l commented Jul 7, 2025 •

edited

Loading