Skip to content

Conversation

@pskiran1
Copy link
Member

For more info, please refer to triton-inference-server/core#455

@pskiran1 pskiran1 changed the title feat: Add support for max_inflight_responses parameter to prevent unbounded memory growth in ensemble models feat: Add support for max_inflight_responses parameter to prevent unbounded memory growth in ensemble models Oct 17, 2025
@pskiran1 pskiran1 requested a review from yinggeh October 23, 2025 08:33
whoisj
whoisj previously approved these changes Oct 24, 2025
@pskiran1 pskiran1 changed the title feat: Add support for max_inflight_responses parameter to prevent unbounded memory growth in ensemble models feat: Add support for max_inflight_requests parameter to prevent unbounded memory growth in ensemble models Oct 24, 2025
@pskiran1 pskiran1 requested a review from yinggeh October 27, 2025 08:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

4 participants