ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Su, Hongjin; Diao, Shizhe; Lu, Ximing; Liu, Mingjie; Xu, Jiacheng; Dong, Xin; Fu, Yonggan; Belcak, Peter; Ye, Hanrong; Yin, Hongxu; Dong, Yi; Bakhturina, Evelina; Yu, Tao; Choi, Yejin; Kautz, Jan; Molchanov, Pavlo

Computer Science > Computation and Language

arXiv:2511.21689 (cs)

[Submitted on 26 Nov 2025]

Title:ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Authors:Hongjin Su, Shizhe Diao, Ximing Lu, Mingjie Liu, Jiacheng Xu, Xin Dong, Yonggan Fu, Peter Belcak, Hanrong Ye, Hongxu Yin, Yi Dong, Evelina Bakhturina, Tao Yu, Yejin Choi, Jan Kautz, Pavlo Molchanov

View PDF HTML (experimental)

Abstract:Large language models are powerful generalists, yet solving deep and complex problems such as those of the Humanity's Last Exam (HLE) remains both conceptually challenging and computationally expensive. We show that small orchestrators managing other models and a variety of tools can both push the upper bound of intelligence and improve efficiency in solving difficult agentic tasks. We introduce ToolOrchestra, a method for training small orchestrators that coordinate intelligent tools. ToolOrchestra explicitly uses reinforcement learning with outcome-, efficiency-, and user-preference-aware rewards. Using ToolOrchestra, we produce Orchestrator, an 8B model that achieves higher accuracy at lower cost than previous tool-use agents while aligning with user preferences on which tools are to be used for a given query. On HLE, Orchestrator achieves a score of 37.1%, outperforming GPT-5 (35.1%) while being 2.5x more efficient. On tau2-Bench and FRAMES, Orchestrator surpasses GPT-5 by a wide margin while using only about 30% of the cost. Extensive analysis shows that Orchestrator achieves the best trade-off between performance and cost under multiple metrics, and generalizes robustly to unseen tools. These results demonstrate that composing diverse tools with a lightweight orchestration model is both more efficient and more effective than existing methods, paving the way for practical and scalable tool-augmented reasoning systems.

Comments:	21 pages, 6 figures
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
Cite as:	arXiv:2511.21689 [cs.CL]
	(or arXiv:2511.21689v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2511.21689

Submission history

From: Shizhe Diao [view email]
[v1] Wed, 26 Nov 2025 18:59:46 UTC (1,321 KB)

Computer Science > Computation and Language

Title:ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators