Prompt Optimization for Analog Circuit Placement

By VizopsAI Team · March 18, 2026 · 8 min read

Prompt optimization reaches 97% of expert analog circuit placement quality — with zero training data.

Placement for analog circuit layout lacks the automated P&R flows available for digital design. Each transistor position affects matching, routing, and parasitics — expert layout engineers spend hours hand-tuning layouts using intuition built over years. We investigate whether prompt optimization methods can teach LLMs to perform this task, starting from zero domain-specific training data. We recently released VizPy and have been evaluating our prompt optimization methods across several use cases. Analog circuit placement proved to be a particularly demanding benchmark due to the tight coupling between spatial reasoning, connectivity analysis, and multi-objective optimization.

Problem Formulation

Given a SPICE netlist (transistors, types, connectivity), produce (x, y) grid coordinates minimizing:

score = 1.0 + (0.8 × wire_penalty + 0.2 × area_penalty)

Wire penalty (80% weight) measures total Manhattan distance between connected device centers. Area penalty (20%) measures bounding box size. No overlaps permitted. Expert placements score 0.65–0.70.

Our dataset consists of 9 differential amplifier variants (5 devices each), split 6/3 train/test. We note this is a small dataset — results should be interpreted with this constraint in mind. We also evaluate two unseen circuit families: CKTA (4-input NAND cascode, 8 devices) and CKTB (SR latch, 8 devices with cross-coupled feedback).

Baseline: Test-time RL Fine-tuning

Test-Time Training (TTT) — RL fine-tuning a 120B parameter model with PUCT tree search — memorized the training circuit (0.670, matching expert) but averaged 0.502 on test circuits, with a worst case of 0.385. The model learned specific coordinate patterns rather than transferable placement principles.

ContraPrompt

ContraPrompt mines good-vs-bad output pairs, extracts rules via a reflection LM, and validates each rule individually before injection. We augmented the standard contrastive pairs with expert (ground truth) placements on the "good" side — rather than comparing the LLM's best vs. worst attempts, we compared the LLM's worst attempt against expert placement descriptions (structural patterns without specific coordinates, to avoid rules that memorize exact positions). Starting from 0.500 baseline, 3 iterations:

Iteration 1: 5 candidate rules extracted, 1 survived validation (4 degraded performance). Score: 0.596.

Iteration 2: 5 more candidates, 3 survived. Score: 0.624.

Iteration 3: 3 candidates extracted but the resulting prompt scored 0.535 — the optimizer correctly rejected this iteration.

The two surviving rules teach column-based layout with drain-pair alignment, discovered entirely from optimization feedback:

Rule 1: Prioritize column-based vertical stacking and structural relationships between devices sharing critical signal nets (drain-paired PMOS and NMOS on the same output nodes).

Rule 2: Separate PMOS and NMOS into distinct columns, enumerate drain-paired devices, and output a structured placement plan rather than stopping at topology understanding.

We expected the rules to encode specific numeric parameters (x-offsets, y-offsets). Instead, they encode high-level strategy — the LLM determines specific values at generation time based on each circuit's device dimensions.

Post-Optimization Code Generation

Train average:

Test average:

Figure 1: Average placement scores across 9 circuits. The gap between ContraPrompt+Opus (0.634) and Expert (0.652) is 0.018 — smaller than the gap between any two adjacent methods in prior work. TTT at 120B parameters scores 0.502, 26% below the prompt-optimized approach.

Figure 2: Per-circuit scores. The consistent gap between ContraPrompt+Opus and Expert across all 9 circuits (no circuit below 0.604) suggests the learned strategy generalizes rather than overfitting to specific topologies. The "No optimization" baseline shows high variance (0.371–0.583), indicating that the minimal prompt alone produces inconsistent strategies.

PromptGrad

GEPA

why

Unseen Circuits

Figure 3: Cumulative best score across attempts. On CKTA, ContraPrompt converges at attempt 1; on CKTB, PromptGrad converges at attempt 2. GEPA plateaus at 0.595 on CKTB — 0.025 below optimal — due to its horizontal layout bias.

Placement Examples

Figure 4: test_amp0 — Expert (0.670) vs. ContraPrompt+Opus (0.670). Both use a two-column layout with NMOS differential pair and tail current source in the left column, PMOS loads in the right.

Figure 5: test_amp2 — Expert (0.692) vs. ContraPrompt+Opus (0.639). The LLM preserves differential pair symmetry and type separation but uses slightly wider column spacing, accounting for the 0.053 gap.

Figure 6: test_amp3 — Expert (0.700) vs. ContraPrompt+Opus (0.662). Device grouping is correct; the score gap comes from suboptimal vertical alignment of the drain-paired devices.

Summary

ContraPrompt discovers domain strategy from scratch.

ContraPrompt and PromptGrad exhibit complementary convergence profiles.

The 3% gap to expert quality likely requires finer-grained spatial control

Want to try prompt optimization on your own use case? Try VizPy free or reach out to us.