Back to Observatory
LIVE · Observatory · Q1 2026

Circuit Canvas

Figma-style attribution graphs for SAE features. Nodes = features. Edges = feature-to-feature attribution, scored by AtP* (Kramár et al. 2024). Pick a scenario below — each is a real circuit computed on our Qwen3.6-27B paper-grade SAEs.

PREVIEW · ATP · Qwen/Qwen3.6-27B · τ_node=0.05 · τ_edge=0.01 · 1 prompts
14 nodes3 edges
Mini Map
metric · logit['heart'] − logit['stomach']prompt · Patient with sudden chest pain. The most likely cause involves the

What you're seeing

Attribution graphs on Qwen3.6-27B paper-grade SAEs. Upstream = L11, downstream = L31. Triangle nodes are SAE reconstruction-error terms (Marks et al. 2024). Edge thickness = |attribution|; orange = positive, cyan = negative. Each scenario uses a task-specific contrastive logit metric.

How to build your own

Run 15b_sfc_qwen36_27b_papergrade.ipynb for the four-scenario SFC pipeline, or 14_attribution_patching.ipynb for node-only AtP*. Both emit circuit.json in the schema this viewer consumes.