Tagged generative-art
2 essays
- Mechanistic interpretability as generative art If a network has learned a concept, that concept is a location in its embedding space. Steer a generator toward that location and you don't get a diagram of what the model knows - you get a picture of it.
- Measuring the platonic representation The platonic representation hypothesis says capable models converge toward one shared picture of reality. A language-anchored multimodal encoder lets you test a sharp version of it: encode one concept through four senses and measure what they agree on.