Library · 06library/index

Field notes, essays, and further reading.

Everything we have published on specialist agents, the operating model, and the institutional discipline that makes them trustworthy.

We prefer to compete on substance. Nothing in this library is proprietary; the moat is in the operating system, not the essays. Read at your own pace, in any order.

Shelf / 01

Canonical chapters

5 min

The thesis

Five claims about specialist agents, read once.

Read→

18 min

The vision

The full argument. Market position, moat, and operating model.

Read→

12 min

The method

Seven pillars, in compounding order.

Read→

9 min

Modelsmith

The product. Evaluation-first specialisation engine.

Read→

8 min

The roadmap

Five phases from wedge to fleet.

Read→

Shelf / 02

Method

6 min readessay

On eval design

Governed scenarios you approve. Expansion scenarios the loop proposes. The separation is what lets the autonomous iterate loop run without drifting its own rubrics.

Read→

5 min readessay

On latency

In programmatic advertising the decision window is measured in tens of milliseconds. Specialist SLMs change what is feasible; a generalist model missing the budget is not slow, it is inoperative.

Read→

Shelf / 03

Operating model

5 min readessay

On the fleet

A fleet of specialists implies a fleet of training hosts. The router is deterministic, not learned. The moat is the specialists, not the dispatch.

Read→

6 min readessay

Six explicit states from candidate to production-accepted. Approval gates at the transitions that matter. Modelsmith supplies the artefact and the rollback contract; you control the shadow and the canary.

Read→

Shelf / 04

Field notes

4 min readessay

On build vs buy

Most teams do not fail because they cannot run a fine-tune. They fail because they cannot institutionalise the closed eval–train loop and the promotion discipline around it.

Read→

4 min readessay

The fork workflow

Platform improvements flow upstream. Customer-owned domain artefacts stay in the customer environment. The boundary is a deliberate design choice, not a legal formality.

Read→

/ archive

The archive grows with the platform. Each persistent failure in the iterate loop, each decision with enough generality to be worth writing down, eventually ends up here.

Start an engagement→

Canonical chapters

The thesis

The vision

The method

Modelsmith

The roadmap

Method

On eval design

On latency

Operating model

On the fleet

On promotion discipline

Field notes

On build vs buy

The fork workflow