🧅 SituatiONION : Transformer Representation Analysis Pipeline

Experimental pipeline for extracting and evaluating hidden state representations from transformer language models to quantify how semantic information emerges across layers.

Overview

Transformer models encode increasingly abstract semantic structure as information propagates through layers. This project builds a reproducible pipeline to:

Extract hidden state representations from each transformer layer
Construct feature datasets from embeddings
Quantitatively evaluate representation quality using linear probes
Visualize representation geometry using dimensionality reduction

Core questions:

Where in the network is semantic information most separable?
How does representation structure evolve across depth?
Which layers produce the most useful downstream features?

Key Results

Figure 1. Linear probe accuracy as a function of transformer layer depth. Separability increases from early layers, peaks in mid-layers, and declines or stabilizes in later layers. This pattern indicates that mid-layers contain the most linearly separable and semantically structured representations, consistent with the SituatiONION hypothesis.

Figure 2. PCA projection of token representations across transformer mid-layers in a shared embedding space. Each point represents a token at a specific layer, and trajectories show how representations evolve with increasing depth. Tokens follow smooth, structured paths and occupy distinct regions of representation space, indicating progressive semantic organization and increasing geometric separability in intermediate layers.

Figure 3. Animated PCA projection of token representations across transformer layers for the sentence "John put the glass on the table. It broke." Token trajectories are relatively diffuse in early layers, undergo pronounced geometric reorganization in mid-layers, and stabilize in later layers. The increased mid-layer movement reflects a transition from surface-level encoding to structured semantic representation, consistent with the SituatiONION hypothesis. ---

Pipeline

Text Input
  ↓
Tokenization
  ↓
Transformer Forward Pass
  ↓
Hidden State Extraction
  ↓
Feature Dataset Construction
  ↓
Evaluation (Linear Probe)
  ↓
Visualization (PCA)

Method

Hidden State Extraction

Extract layer-wise representations using HuggingFace Transformers:

outputs = model(**inputs, output_hidden_states=True)
hidden_states = outputs.hidden_states

Produces tensor:

[num_samples, num_layers, hidden_dimension]

Linear Probe Evaluation

Train linear classifiers to measure semantic separability:

probe = LogisticRegression(max_iter=1000)
probe.fit(X_train, y_train)
accuracy = probe.score(X_test, y_test)

Higher accuracy → stronger semantic encoding.

Evaluated independently across all layers.

Visualization

Use PCA to inspect representation geometry:

pca = PCA(n_components=2)
X_reduced = pca.fit_transform(X)

Reveals clustering and structural evolution across layers.

Tech Stack

Python
PyTorch / HuggingFace Transformers
NumPy
scikit-learn
Matplotlib

Results Summary

Typical findings:

Early layers encode lexical features
Middle layers encode semantic structure
Later layers encode task-specific abstractions
Linear separability peaks in mid-to-late layers

Confirms progressive semantic organization in transformer representations.

Applications

Model interpretability
Representation evaluation
Feature extraction
Transformer analysis
Downstream ML feature engineering

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
figures		figures
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
SituatiONION.ipynb		SituatiONION.ipynb
SituatiONION1.ipynb		SituatiONION1.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧅 SituatiONION : Transformer Representation Analysis Pipeline

Experimental pipeline for extracting and evaluating hidden state representations from transformer language models to quantify how semantic information emerges across layers.

Overview

Key Results

Pipeline

Method

Hidden State Extraction

Linear Probe Evaluation

Visualization

Tech Stack

Results Summary

Applications

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧅 SituatiONION : Transformer Representation Analysis Pipeline

Experimental pipeline for extracting and evaluating hidden state representations from transformer language models to quantify how semantic information emerges across layers.

Overview

Key Results

Pipeline

Method

Hidden State Extraction

Linear Probe Evaluation

Visualization

Tech Stack

Results Summary

Applications

Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages