Field Note · 2026.04

SMPL Team / April 2026

The real bottleneck is usually not engineering capacity.

What a real deployment taught us about backlog anxiety, tribal knowledge, and why understanding compounds faster than output.

The Short Answer

What teams describe as a throughput problem is often illegibility in disguise. We deployed the system on a complex monorepo and saw 67 effort points moved to ready-for-review in week one — roughly the entire engineering organization's typical weekly output.

The interesting result wasn't the throughput. It was where the bottleneck moved next: to QA downstream and product specification upstream. Engineering drag had been masking constraints that only surfaced once engineering stopped absorbing them.

The lesson — accumulated across this and a second support-oriented deployment — is that what people are usually buying when they ask for AI engineering capacity is something more fundamental: certainty, legibility, and continuity.

From throughput framing to what we saw in practice.

The backlog is growing. Senior engineers are overloaded. Support escalations take too long. Architecture questions keep pulling the same people into the same conversations.

The natural conclusion is that the team needs more capacity. Sometimes that is true. But often it is incomplete.

What teams experience as a capacity problem is frequently an understanding problem in disguise.

We deployed the system on a bootstrapped engineering team of 10+ engineers with a complex monorepo and the usual concentration of knowledge in a few senior people. For context, the broader engineering organization was averaging roughly 136 effort points per two-week sprint — about 68 points per week. The obvious expectation was more output. That happened.

Effort points moved
to ready-for-review · week one

Tickets reviewed
in a single day

< 24 h

Onboarded to the
full codebase

In one week, smpl alone roughly matched the entire organization's typical weekly engineering output. But those numbers were not the most important part of the story. The more important thing was what the system was actually doing underneath the output: reading tickets before engineers touched them, tracing relevant code paths, surfacing ambiguity, asking pushback questions, and producing reusable understanding instead of disposable work.

The surprising result, and where complexity actually shows up.

Leadership did not slow expansion because the system was underperforming. They slowed expansion because it exposed a deeper organizational reality. Once engineering throughput increased, the visible bottleneck moved. Engineering was suddenly being squeezed on both sides — QA couldn't keep up downstream, and product specification couldn't define tickets deeply enough upstream. Work that had previously looked like an engineering throughput problem turned out to be a systems-legibility problem.

That is the real lesson. More output is useful, but clarity about the actual system is often more valuable than the extra code itself. Leaders often assume the cost of complexity shows up only in delivery speed. In reality it shows up everywhere:

Tickets arrive underspecified because nobody has enough shared context.
Investigations restart from zero because prior knowledge is not preserved.
Support escalations require the same senior people to reconstruct the same history.
Onboarding takes too long because understanding is socially distributed rather than operationally captured.
Planning quality suffers because no one has a legible picture of what the codebase actually implies.

Teams interpret the pain as "we need more engineers." Often what they really need is a way to make the system more visible, more investigable, and less dependent on who happens to remember it.

A second proof, and what people are really buying.

We saw the same pattern in a support-oriented deployment. The system investigated escalations, produced root cause analysis in about 15 minutes, and created documents that made the subsequent conversation faster and better. The immediate win was speed. The deeper win was continuity.

Each investigation made the next one better. Patterns accumulated. Knowledge stopped disappearing after the thread went cold. The value was not just faster answers — it was institutional memory that compounded.

People say they want AI engineering capacity. Sometimes they do. But underneath that request, they are usually buying something more fundamental:

Less dependence on key individuals.
Faster confidence on tickets and incidents.
A more legible codebase.
Continuity when people leave.
A way to scale understanding before scaling headcount.

In other words, they are buying certainty, legibility, and continuity.

Findings, and the practical implication.

The bottleneck is often hidden by the current level of drag. If engineering is slow enough, every other problem stays masked behind it.
Investigation matters as much as execution. Evaluating work before it starts changes throughput more reliably than just increasing code generation.
Accumulated understanding is the real compounding asset. Tools that start from zero each time do not create institutional memory. Systems that preserve and deepen context do.
The strongest story is not "AI engineer." The stronger story is persistent codebase intelligence that can extend into investigation and execution.

If you are leading an engineering team with a growing or aging codebase, the first question is not just "how do we move faster?" It is:

How much of our velocity problem is really a legibility problem?

If the answer is "more than we'd like to admit," the highest-leverage move is not another typing accelerator. It is a system that maps the codebase, investigates issues before engineers lose time, and preserves understanding as an operational asset. That is the shift. Not from humans to AI. From tribal knowledge to persistent intelligence.

See what this looks like on your codebase.

The Codebase Intelligence Review is a written assessment of where legibility, context loss, and structural drag are most expensive in your system. Selective; reviewed manually.

See the Codebase Intelligence Review → More field notes Return to homepage