-
Notifications
You must be signed in to change notification settings - Fork 1.8k
[TRTLLM-8269][fix] Revert "do not explicitly pass temperature=0 to select greedy sampling" #8103
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[TRTLLM-8269][fix] Revert "do not explicitly pass temperature=0 to select greedy sampling" #8103
Conversation
…elect gr…" This reverts commit 31a1a5f. Signed-off-by: ixlmar <[email protected]>
097f8ae
to
74e95ca
Compare
📝 WalkthroughWalkthroughIntroduces explicit temperature=0 in evaluation and test code paths: adds "temperature": 0 to sampling arguments in json_mode_eval.py, updates generate_samples in mmlu.py to yield {"temperature": 0} instead of None, and sets temperature=0 in a specific OpenAI chat completion test. Changes
Estimated code review effort🎯 2 (Simple) | ⏱️ ~10 minutes Pre-merge checks and finishing touches❌ Failed checks (2 warnings)
✅ Passed checks (1 passed)
✨ Finishing touches
🧪 Generate unit tests
📜 Recent review detailsConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Pro 📒 Files selected for processing (3)
🧰 Additional context used📓 Path-based instructions (3)**/*.{h,hpp,hh,hxx,cpp,cxx,cc,cu,cuh,py}📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
**/*.py📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
**/*.{cpp,cxx,cc,h,hpp,hh,hxx,cu,cuh,py}📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
🧬 Code graph analysis (2)tensorrt_llm/evaluate/json_mode_eval.py (1)
tests/unittest/llmapi/apps/_test_openai_misc.py (1)
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)
🔇 Additional comments (3)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. 🧪 Early access (Sonnet 4.5): enabledWe are currently testing the Sonnet 4.5 model, which is expected to improve code review quality. However, this model may lead to increased noise levels in the review comments. Please disable the early access features if the noise level causes any inconvenience. Note:
Comment |
/bot run --disable-fail-fast |
PR_Github #20402 [ run ] triggered by Bot |
PR_Github #20402 [ run ] completed with state |
/bot run --only-multi-gpu-test --disable-fail-fast |
…lect greedy sampling" (NVIDIA#8103) Signed-off-by: ixlmar <[email protected]> Signed-off-by: Faradawn Yang <[email protected]>
…lect greedy sampling" (NVIDIA#8103) Signed-off-by: ixlmar <[email protected]> Signed-off-by: Faradawn Yang <[email protected]>
…lect greedy sampling" (NVIDIA#8103) Signed-off-by: ixlmar <[email protected]>
…lect greedy sampling" (NVIDIA#8103) Signed-off-by: ixlmar <[email protected]> Signed-off-by: Faradawn Yang <[email protected]>
…lect greedy sampling" (NVIDIA#8103) Signed-off-by: ixlmar <[email protected]> Signed-off-by: Faradawn Yang <[email protected]>
…lect greedy sampling" (NVIDIA#8103) Signed-off-by: ixlmar <[email protected]> Signed-off-by: Faradawn Yang <[email protected]>
Reverts #7909
Summary by CodeRabbit
Bug Fixes
Tests