Enable parallel instance loading backend attribute #284

rmccorm4 · 2023-07-31T23:30:23Z

As far as I can tell, the Python Backend is completely thread-safe on calls to TRITONBACKEND_ModelInstanceInitialize

All use of model_state (the problem area with instance initialization in current backends) appears to be read-only.

It is passed to constructor for ModelInstanceState very simply.

L0_backend_python has been hanging on BLS model load tests, but this is now happening on main branches too, so I don't believe this is due to any parallel instance changes.

Corresponding tests: triton-inference-server/server#6126

rmccorm4 · 2023-08-01T21:02:14Z

Going to merge for the sake of getting more pipelines running before code freeze to catch any unforeseen issues as early as possible. Tests may have some minor changes, as discussed above, I don't anticipate any python backend specific changes other than enabling the attribute. If we run into issues before the release, I can disable the attribute.

I'll update some docs separately where applicable.

Enable parallel instance loading backend attribute

827ea76

rmccorm4 requested review from Tabrizian, krishung5 and tanmayv25 July 31, 2023 23:30

Tabrizian approved these changes Aug 1, 2023

View reviewed changes

krishung5 approved these changes Aug 1, 2023

View reviewed changes

rmccorm4 merged commit 823f628 into main Aug 1, 2023

rmccorm4 deleted the rmccormick-parallel branch August 1, 2023 21:05

rmccorm4 mentioned this pull request Aug 10, 2023

Is concurrent model instance loading supported? triton-inference-server/server#5709

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable parallel instance loading backend attribute #284

Enable parallel instance loading backend attribute #284

Uh oh!

rmccorm4 commented Jul 31, 2023 •

edited

Loading

Uh oh!

rmccorm4 commented Aug 1, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

Enable parallel instance loading backend attribute #284

Enable parallel instance loading backend attribute #284

Uh oh!

Conversation

rmccorm4 commented Jul 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rmccorm4 commented Aug 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

rmccorm4 commented Jul 31, 2023 •

edited

Loading

rmccorm4 commented Aug 1, 2023 •

edited

Loading