| Agent | Model | Reward | Tools | Classification | |
|---|---|---|---|---|---|
| codex | gpt-5.5 | ✓ resolved | 20 | BAD_SUCCESS | view → |
| codex | gpt-5.5 | ✓ resolved | 13 | BAD_SUCCESS | view → |
| codex | gpt-5.5 | ✓ resolved | 25 | BAD_SUCCESS | view → |
The task as the agent saw it and the verifier graded it. Files under solution/ are sealed.