Skip to content

Question Regarding Tool Exposure Strategy in APEX-Agents Leaderboard Evaluation #25

@TheRoadQaQ

Description

@TheRoadQaQ

I have a quick question regarding your APEX-Agents leaderboard setup using archipelago.

I noticed the use of:

  • toolbelt_list_tools
  • toolbelt_inspect_tool
  • toolbelt_add_tool
  • toolbelt_remove_tool

Could you share the reasoning behind this progressive tool exposure design?

From our perspective, revealing new tools by injecting updated tool definitions into the context may invalidate the KV cache (since tool schemas are placed at the beginning of the prompt), which could impact performance.

Was this trade-off intentional, or were alternative approaches considered?

For example, would it be feasible to only expose tool names initially, and return detailed specifications dynamically via tool call responses instead?

Thanks in advance for your insights!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions