Draft · 2026   Draft

Why active learning is the key to physical AI

The most expensive thing a learning system can do is collect the wrong data. We don’t usually talk about it that way, because for the last decade the dominant strategy has been to collect everything and let scale sort it out. That worked astonishingly well. It will not keep working, and the reason is the thing I spent my doctorate on: uncertainty.

Active learning is the simple idea that a model should choose what to learn next. Instead of being handed a fixed dataset, it looks at the world, asks where it is most uncertain, and spends its limited budget of attention (queries, labels, experiments, real-world trials) exactly there. To do that well, a model has to be honest about what it doesn’t know. That turns out to be the hard part. Most deep networks are confidently wrong; calibrating their uncertainty is a real research problem, and it’s the one I kept circling back to.

For physical AI, this isn’t academic. A robot cannot scrape a trillion tokens of experience overnight. Every trial costs time, hardware, and risk. The systems that win in the physical world will be the ones that extract the most learning from the fewest interactions. They are the ones that know where their own understanding is thinnest and go probe it. Uncertainty estimation is what tells an embodied agent when to act and when to gather more information first. It is also, not incidentally, a safety property: a system that knows it doesn’t know can ask for help instead of charging ahead.

The same logic is starting to bite at the other end of the field. Large language models are running into diminishing returns, not because there’s no data left, but because the marginal token is mostly redundant with what the model already knows. Pouring in more of the same buys less and less. The way out isn’t a bigger pile; it’s selection. A model that can identify the regions where it is genuinely uncertain, and direct data collection, synthetic generation, or human feedback toward exactly those regions, gets far more out of every additional example. Active learning is how you turn a brute-force curve back into an exponential one.

Knowing what you don’t know is not a footnote to intelligence. It’s the steering wheel.

That’s why I keep coming back to it. Whether the system has a body or lives on a server, the binding constraint is the same: not how much it can absorb, but how well it can choose. Get that right and physical AI becomes tractable and language models get a second wind. Get it wrong and you spend forever collecting the wrong data.

← Back to writing