Our Optimizer Beat a Production-Grade Blackbox Optimizer on 96% of Benchmarks
We hand an LLM a 9-line random-search stub and five rounds of contrastive feedback later, it produces a specialized solver that matches or outperforms Optuna on 96% of all 55 EvalSet benchmarks — with no human-written solver code.