Question Regarding Tool Exposure Strategy in APEX-Agents Leaderboard Evaluation

I have a quick question regarding your APEX-Agents leaderboard setup using archipelago.

I noticed the use of:

* toolbelt_list_tools
* toolbelt_inspect_tool
* toolbelt_add_tool
* toolbelt_remove_tool

Could you share the reasoning behind this progressive tool exposure design?

From our perspective, revealing new tools by injecting updated tool definitions into the context may invalidate the KV cache (since tool schemas are placed at the beginning of the prompt), which could impact performance.

Was this trade-off intentional, or were alternative approaches considered?

For example, would it be feasible to only expose tool names initially, and return detailed specifications dynamically via tool call responses instead?

Thanks in advance for your insights!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question Regarding Tool Exposure Strategy in APEX-Agents Leaderboard Evaluation #25

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Question Regarding Tool Exposure Strategy in APEX-Agents Leaderboard Evaluation #25

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions