alopezari · April 30, 2026 11:07
diff --git a/magellan-token-optimizations.md b/magellan-token-optimizations.md
Variant	Model	Validated recall
`planner-opus`	Opus 4.7	10/10 (regression-test plugin)
`planner-sonnet`	Sonnet 4.6	9/10 (Pilot 17, magellan-backups)
`planner-haiku`	Haiku 4.5	Not validated — known recall regression risk
Lever	Impact	Notes
Model selection (Opus→Sonnet→Haiku)	High	Sonnet is 40% of Opus; Haiku is ~10% of Opus across all cache tiers
Context inflation prevention	High	Tester prose returns inflated Manager cc1h 18× in one pilot
Turn count reduction	Medium	~30 turns saved by script-driven phases; each turn has output + cache-write cost
Prompt size reduction	Low	Cache reads are billed per token regardless; reducing prompt size saves only uncached input reads
Parallel wave dispatch	No direct token impact	Reduces wallclock, not token count