7 Jun 2026/4 min read

Engineered, not prompted

Most AI demos work once. Production work has to survive the messy case, the edge case, and the Tuesday everyone forgets.

There is a gap between a demo that works on stage and a system that works on a Tuesday. The demo gets the clean input. Production gets the typo, the half-finished sentence, the customer who replies to the wrong thread.

Prompting gets you the demo. Engineering gets you the system: retrieval over real data, scoring against real criteria, an evaluation harness on real historical cases, and a fallback for when the model is unsure.

We build the second thing. Before a system is yours, it has to pass the cases that broke the last one. If it does not survive that, it does not ship.

<-All writing Start a build