You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Jul 16, 2025. It is now read-only.
When processing large images or documents (file) with every request the files is send as encoded data. This increases the token usage since every time the file needs to be ingested again. Especially with Tool Calling this leads to a high usage of tokens.
To mitigate this issues some platforms provide a way to cache input files. It would be nice to fine a way to enable caching.