| Agent | Env Reward | Shaped Reward | ฮ vs Baseline | Skills |
|---|---|---|---|---|
| Random Agent | +0.039 | N/A | โ | 0 |
| Basic LLM (pre-train) | +0.021 | +4.69 | ref | 0 |
| Voyager-lite (pre-train) | +0.024 | +5.71 | +1.02 (+22%) | 2 |
| Basic LLM (post-train) | +0.023 | +4.56 | -0.13 | 0 |
| โญ Voyager-lite (post-train) | +0.025 | +5.53 | +0.84 (+18%) | 2 |