Garrett Allen

Founding AI Engineer at LayerLens.ai. I build evaluation systems for LLMs and agents. Before that, I studied math at USC and played Division I NCAA Men's Water Polo.

I'm interested in world models, machine agency, and agent-to-agent communication.

Recent Work

CodaLM SAE: Sparse Autoencoders for Sperm Whale Communication

Sparse autoencoders applied to a small whale language model. SAE features recover clan identity without supervision and transfer across time and oceans.

Paying for Signal: Randomized Counterfactual Credit

How do you pay people for information without making them gamble? A sketch.

Selected Projects

LayerLens.ai

Evaluation platform for LLMs. Benchmarks and observability.

microcontract

Infrastructure for agents to negotiate and sign contracts with each other.

JEPA-Image-World-Model

Learning world models from Minecraft gameplay using JEPA.

LLM-Games

Game environments for testing how well LLMs reason and plan.

negotiation-env

Simulation environment for training agents to negotiate.

Favorites

Papers: Platonic Representation Hypothesis, A Path Towards Autonomous Machine Intelligence, IWM-JEPA, MindEye 2, o1 System Card, DeepSeekMath, SimpleQA, Voyager, Entropy SGD, Scaling and Evaluating SAEs.

Posts: Dealing with Daemons through Algorithmic Contracts, The Model Does the Eval, Why Tool AIs Want to Be Agent AIs, Jailbreaking Frontier Models (PRBO), Clever Hans, ATPAMI: Unsolved Technical Alignment Problems.

Books: UDL, Topology, Baby Rudin, Arbitrage Theory in Continuous Time, Programming Massively Parallel Processors, Neural Networks and Numerical Analysis.

Repositories: Mineflayer, V-JEPA, Verifiers, TAU-Bench.