Each card uses downsampled Q, K, and V head partitions from the selected attention layer.
Per-head Q, K, and V slices stay read-only and downsampled for browser-safe inspection.
Provide a short sequence, select the query token index, and compute a compact read-only activation path for the current head.
Shows the selected token query vector plus compact key and value examples for the current sequence.
Displays the softmax profile across the sequence and the strongest attended tokens.
Summarizes the selected token, the top attended targets, and the resulting head output vector.
Compare head similarity, clusters, redundancies, specialization, and anomalies for the current layer.
Cosine similarity across lightweight per-head Q/K/V feature vectors for this layer.
PCA projection of head features. Nearby points indicate similar query/key/value behaviour.
Track how one selected head contributes to the residual stream and which heads respond most strongly in the next layers.
Use a short sequence and selected token to compute a compact, read-only cross-layer flow from the current head.
Shows the selected source head and the strongest responding heads across the configured depth.
Each row represents one hop and each column shows the influence score for a target head in the next layer.
Inspect concept-like neurons, sparse detectors, sampled token correlations, and neuron directions for the selected feed-forward layer.
Sort and filter neurons by activation, sparsity, and concept-like scores.
Choose a neuron to inspect current-input activation, token correlations, and direction alignment.
This page turns the trained checkpoint into a browser-safe 3D brain atlas. The scene uses real learned tensors, samples the strongest learned rows from each layer, and places them inside an outlined brain shell so you can orbit, inspect, and travel across the network.
Press Start Atlas to begin building the neural journey from the current trained checkpoint.