What you're seeing
Attribution graphs on Qwen3.6-27B paper-grade SAEs. Upstream = L11, downstream = L31. Triangle nodes are SAE reconstruction-error terms (Marks et al. 2024). Edge thickness = |attribution|; orange = positive, cyan = negative. Each scenario uses a task-specific contrastive logit metric.