[AutoRL-Bench] Fix ALFWorld prompt exposure, update DeepSearchQA split, and translate tracked files to English by couragec · Pull Request #1367 · microsoft/RD-Agent

couragec · 2026-03-19T12:12:13Z

Summary

This PR updates AutoRL-Bench in three ways:

Stop exposing react_prompts.json to agents in ALFWorld.
Update DeepSearchQA to use the intended local split protocol.
Translate tracked AutoRL-Bench files and documentation into English for open-source readiness.

Changes

ALFWorld
- remove task-specific prompt exposure from the mounted workspace
DeepSearchQA
- switch to deterministic local split
- update the default protocol to 100 train / 200 eval
Documentation / open-source cleanup
- translate tracked files under rdagent/scenarios/rl/autorl_bench
- add bilingual README support

Notes

This branch is based on the latest upstream/main.
The PR keeps the existing fork-side AutoRL-Bench commits and merges in recent upstream updates.

📚 Documentation preview 📚: https://RDAgent--1367.org.readthedocs.build/en/1367/

…tream-sync

couragec added 5 commits March 18, 2026 04:14

fix(alfworld): stop exposing react prompts to agents

81649dd

fix(deepsearchqa): use deterministic 100/800 split

fba5c2e

fix(deepsearchqa): switch default split to 100/200

d0f5b71

chore(autorl-bench): translate tracked files to english

014f031

Merge remote-tracking branch 'upstream/main' into pr/autorl-bench-ups…

49d4df2

…tream-sync

couragec closed this Mar 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AutoRL-Bench] Fix ALFWorld prompt exposure, update DeepSearchQA split, and translate tracked files to English#1367

[AutoRL-Bench] Fix ALFWorld prompt exposure, update DeepSearchQA split, and translate tracked files to English#1367
couragec wants to merge 5 commits intomicrosoft:mainfrom
couragec:pr/autorl-bench-upstream-sync

couragec commented Mar 19, 2026 •

edited by github-actions bot

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

couragec commented Mar 19, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

couragec commented Mar 19, 2026 •

edited by github-actions bot

Loading