Pull requests welcome. If you're adding a new scenario, include tests.
pip install -e ".[dev,inference]"
python -m pytest tests/ -v
ruff check . && ruff format .- Add the scenario config in
server/attack_engine.py(seeSCENARIOSdict) - Update
openenv.yamlwith the task definition - Add tests in
tests/test_scenarios.py - Update baseline scores in the README