A sandboxed Linux system-administration training environment for LLM agents. 35 tools · 15 graded tasks · adversarial traps · multi-signal reward shaping.
{"tool":"..","params":{}} per step. Supports any OpenAI-compatible model./etc/hosts ·
Task 6/8 may write /etc/hosts.deny ·
Task 6 may write /etc/passwd
| # | Task | Trap | Platform | Description |
|---|