I have a quick question regarding your APEX-Agents leaderboard setup using archipelago.
I noticed the use of:
- toolbelt_list_tools
- toolbelt_inspect_tool
- toolbelt_add_tool
- toolbelt_remove_tool
Could you share the reasoning behind this progressive tool exposure design?
From our perspective, revealing new tools by injecting updated tool definitions into the context may invalidate the KV cache (since tool schemas are placed at the beginning of the prompt), which could impact performance.
Was this trade-off intentional, or were alternative approaches considered?
For example, would it be feasible to only expose tool names initially, and return detailed specifications dynamically via tool call responses instead?
Thanks in advance for your insights!
I have a quick question regarding your APEX-Agents leaderboard setup using archipelago.
I noticed the use of:
Could you share the reasoning behind this progressive tool exposure design?
From our perspective, revealing new tools by injecting updated tool definitions into the context may invalidate the KV cache (since tool schemas are placed at the beginning of the prompt), which could impact performance.
Was this trade-off intentional, or were alternative approaches considered?
For example, would it be feasible to only expose tool names initially, and return detailed specifications dynamically via tool call responses instead?
Thanks in advance for your insights!