Anthrophic Natural Langauge Autoencoders

Updated 16 May 2026

Interesting technique.

Take a activation vector, use words as the latent space, and map back to activation vector. Figure out what model is thinkining internally.