[ROCm] Effort to reduce the number of environment variables in command line #17229

hongxiayang · 2025-04-26T14:17:58Z

This is to set two environment variables in the Docker file so that users can reduce the number of environment variables when running scripts.

ENV that can improve safe tensor loading, and end-to-end time

ENV SAFETENSORS_FAST_GPU=1

ENV that needed for multi-process on cuda-like platform

ENV VLLM_WORKER_MULTIPROC_METHOD=spawn

Test:

build the docker image

DOCKER_BUILDKIT=1 docker build -f docker/Dockerfile.rocm -t vllm-rocm .

run the docker container and check the env are set in the container

docker run --rm vllm-rocm env | grep spawn
VLLM_WORKER_MULTIPROC_METHOD=spawn

docker run --rm vllm-rocm env | grep FAST
SAFETENSORS_FAST_GPU=1

Signed-off-by: Hongxia Yang <[email protected]>

github-actions · 2025-04-26T14:18:08Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

houseroad · 2025-04-26T16:20:53Z

Can we add a test plan?

chaunceyjiang · 2025-04-27T06:03:05Z

docker/Dockerfile.rocm

@@ -114,6 +114,12 @@ COPY --from=export_vllm /examples ${COMMON_WORKDIR}/vllm/examples
 ENV RAY_EXPERIMENTAL_NOSET_ROCR_VISIBLE_DEVICES=1
 ENV TOKENIZERS_PARALLELISM=false

+# ENV that can improve safe tensor loading, and end-to-end time
+ENV SAFETENSORS_FAST_GPU=1


ENV that can improve safe tensor loading,

I didn't find this variable in the vllm repository. Could you remind me why it can improve loading time?

This is for the safe tensor, see https://huggingface.co/docs/safetensors/en/speed for more details.

houseroad · 2025-04-27T17:26:29Z

docker/Dockerfile.rocm

+# ENV that can improve safe tensor loading, and end-to-end time
+ENV SAFETENSORS_FAST_GPU=1
+# ENV that needed for multi-process on cuda-like platform
+ENV VLLM_WORKER_MULTIPROC_METHOD=spawn


Can we add comment to elaborate why spawn is needed here? Is it due to some compatibility issue?

Can we add comment to elaborate why spawn is needed here? Is it due to some compatibility issue?

There was time this was fixed for ROCm and the fix is to force using spawn when on ROCm platform. See below issue:
#7791

However, during llama4 enablement in March, I found the issue was back when running simple scripts, and we had set the env since then in our scripts.

So to make it safe and stable for all situations, I think it may be a user-friendly thing to put it in the docker file.

Right now, the default is set to "fork" in envs.py,

# Use dedicated multiprocess context for workers. # Both spawn and fork work "VLLM_WORKER_MULTIPROC_METHOD": lambda: os.getenv("VLLM_WORKER_MULTIPROC_METHOD", "fork"),

But, if we search the code, there are many places to force setting "VLLM_WORKER_MULTIPROC_METHOD=spawn".
for example:

if reason is not None: logger.warning( "We must use the `spawn` multiprocessing start method. " "Overriding VLLM_WORKER_MULTIPROC_METHOD to 'spawn'. " "See https://docs.vllm.ai/en/latest/getting_started/" "troubleshooting.html#python-multiprocessing " "for more information. Reason: %s", reason) os.environ["VLLM_WORKER_MULTIPROC_METHOD"] = "spawn"

Can we add comment to elaborate why spawn is needed here? Is it due to some compatibility issue?

@houseroad Done

hongxiayang · 2025-04-27T19:56:23Z

Can we add a test plan?

Added the test plan in the description of the pull request

Signed-off-by: Hongxia Yang <[email protected]>

houseroad

Looks good.

…d line (vllm-project#17229) Signed-off-by: Hongxia Yang <[email protected]>

…d line (vllm-project#17229) Signed-off-by: Hongxia Yang <[email protected]> Signed-off-by: Mu Huai <[email protected]>

…d line (vllm-project#17229) Signed-off-by: Hongxia Yang <[email protected]> Signed-off-by: Yuqi Zhang <[email protected]>

…d line (vllm-project#17229) Signed-off-by: Hongxia Yang <[email protected]> Signed-off-by: minpeter <[email protected]>

[ROCm] effort to reduce envrionment variables in command line

1210933

Signed-off-by: Hongxia Yang <[email protected]>

mergify bot added the ci/build label Apr 26, 2025

houseroad added the rocm Related to AMD ROCm label Apr 26, 2025

chaunceyjiang reviewed Apr 27, 2025

View reviewed changes

houseroad reviewed Apr 27, 2025

View reviewed changes

hongxiayang added 2 commits April 28, 2025 02:07

add a comment

1b359b7

Signed-off-by: Hongxia Yang <[email protected]>

add a link about cuda-in-multiprocessing

fd5fb36

Signed-off-by: Hongxia Yang <[email protected]>

houseroad approved these changes May 1, 2025

View reviewed changes

houseroad added the ready ONLY add when PR is ready to merge/full CI is needed label May 1, 2025

houseroad enabled auto-merge (squash) May 1, 2025 03:20

vllm-bot merged commit 90d0a54 into vllm-project:main May 1, 2025
37 of 38 checks passed

radeksm pushed a commit to radeksm/vllm that referenced this pull request May 2, 2025

[ROCm] Effort to reduce the number of environment variables in comman…

26855a5

…d line (vllm-project#17229) Signed-off-by: Hongxia Yang <[email protected]>

zzzyq pushed a commit to zzzyq/vllm that referenced this pull request May 24, 2025

[ROCm] Effort to reduce the number of environment variables in comman…

c3be0ae

…d line (vllm-project#17229) Signed-off-by: Hongxia Yang <[email protected]> Signed-off-by: Yuqi Zhang <[email protected]>

minpeter pushed a commit to minpeter/vllm that referenced this pull request Jun 24, 2025

[ROCm] Effort to reduce the number of environment variables in comman…

e1af96a

…d line (vllm-project#17229) Signed-off-by: Hongxia Yang <[email protected]> Signed-off-by: minpeter <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[ROCm] Effort to reduce the number of environment variables in command line #17229

[ROCm] Effort to reduce the number of environment variables in command line #17229

Uh oh!

hongxiayang commented Apr 26, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Apr 26, 2025

Uh oh!

houseroad commented Apr 26, 2025

Uh oh!

chaunceyjiang Apr 27, 2025

Uh oh!

houseroad Apr 27, 2025

Uh oh!

houseroad Apr 27, 2025

Uh oh!

hongxiayang Apr 27, 2025 •

edited

Loading

Uh oh!

hongxiayang Apr 28, 2025

Uh oh!

hongxiayang commented Apr 27, 2025

Uh oh!

houseroad left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[ROCm] Effort to reduce the number of environment variables in command line #17229

[ROCm] Effort to reduce the number of environment variables in command line #17229

Uh oh!

Conversation

hongxiayang commented Apr 26, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 26, 2025

Uh oh!

houseroad commented Apr 26, 2025

Uh oh!

chaunceyjiang Apr 27, 2025

Choose a reason for hiding this comment

Uh oh!

houseroad Apr 27, 2025

Choose a reason for hiding this comment

Uh oh!

houseroad Apr 27, 2025

Choose a reason for hiding this comment

Uh oh!

hongxiayang Apr 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hongxiayang Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

hongxiayang commented Apr 27, 2025

Uh oh!

houseroad left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

hongxiayang commented Apr 26, 2025 •

edited by github-actions bot

Loading

hongxiayang Apr 27, 2025 •

edited

Loading