Interpreting and Steering Features in Images
A CLIP embedding is a dense vector of 1,280 numbers — none of them mean anything on their own. A sparse autoencoder unpacks that vector into ~160k interpretable features, of which only a few dozen ever fire. Turn one up, turn another down, regenerate.