Blueprint

Taste Evaluator Blueprint

A build-ready blueprint for a generative-to-evaluator pipeline that scales human taste at machine speed — a taste corpus, an on/off-brand filter, the 1,000→100→10→1 funnel, and a human-in-the-loop loop that prevents the Flattening Effect.

ShareX LinkedIn Facebook Email

Curation funnel

Generated

1000

Shortlist

100

Finalists

Shipped

What it does

When execution commoditizes, judgment becomes the scarce differentiator. Chapter 7 makes the case through Midjourney: competitors have the same diffusion architectures, much of it open source, yet none has reproduced Midjourney's output, because "the moat is taste — and taste is not something any open-source license can distribute." Chapter 9 names the person who holds that moat — the Force Multiplier — and the mechanism that scales her judgment: the evaluator agent. Figure 9.4 is the architecture in one line: a generative swarm produces 1,000 variations, a curated library narrows them, the Force Multiplier's filter keeps the top 100, and the best is deployed — "scaling human judgment at machine speed."

This blueprint is the build spec for that pipeline. It is not a scoring tool; it is an engineering brief for a system that operationalizes Taste as a Moat. The core insight from Chapter 9 is what the pipeline must respect: agents "detect violations of existing aesthetic rules with ruthless precision. Only humans sense when the rules themselves need revision." So the system generates and filters at machine scale, but it keeps a human in the loop at the exact point where the rules might need to change — and it actively defends against the Flattening Effect, the documented drift where a Force Multiplier's rejection rate falls from 50% in week one to under 10% by month six, not because she got better but because her taste converged with the agents'.

The pipeline has four stages — taste corpus, on/off-brand filter, the 1,000→100→10→1 funnel, and the human-in-the-loop calibration loop — and Sofia Marchetti's "staleness detector" from Chapter 9 is built in: a penalty for recommendations too similar to recent output, measured across silhouette, layering, palette, and accessory dimensions over rolling windows.

Who it's for: Force Multipliers and the Architects who build alongside them, in any AI-Born firm where output is commoditized and brand-distinctive taste is the durable advantage — fashion, design, content, product.

Figure: The four-stage evaluator architecture this blueprint specifies — generative swarm to curated library to Force Multiplier filter to deployed best.

Domain

Where output is commoditized and brand-distinctive taste is the durable advantage.

Stage 1 — Taste corpus

The Force Multiplier’s accumulated judgment, versioned. Borderline stays a distinct label.

On-brand

Off-brand

Borderline

Corpus total: 240.

Brand dimensions (comma-separated)

The aesthetic axes Sofia’s staleness detector measures across.

Stage 3 — Funnel ratios

Generate N

Curate to M

Filter to K

Default 1,000 → 100 → 10 → 1 (Figure 9.4). The human narrowing cannot be skipped.

Exploration randomness30

Sofia's lever — temperature 0–1. Raise it to force the swarm out of a local maximum.

Anti-flattening guardrails

Staleness window (weeks)

Rejection-rate floor (%)

Default 2-week window; 12% floor (Chapter 9). Below the floor → recalibration gate.

Escalation rules (route to human regardless of score)

Funnel simulation

Recent-output similarity (rolling window)35

Drives the staleness penalty. Push it high to watch the funnel converge on a local maximum.

Run seed (deterministic)

Same seed + corpus version reproduces identical evaluator scores — audits replay exactly.

1000 → 100 → 10 → 1 · funnel run · corpus ve18a4f

Deploy held

Rejection rate below the flattening floor — recalibration required before the next deploy.

Drift dashboard

Rejection rate (floor 12%)0%

Mean batch staleness29

Shortlist escalated to human (0/10)0%

Flattening alert · deploy blocked

Rejection rate has fallen below 12%. This is not skill improvement — it’s taste convergence with the agents. Rotate through non-AI environments and recalibrate before the distinctiveness erodes invisibly. The next deploy is gated until recalibration.

Evaluator scorecard · shortlisted 10 (top 10)

#	On-brand	Staleness	Net	Route
961	85	31	54	auto
106	82	30	52	auto
610	82	30	52	auto
580	81	30	51	auto
119	81	30	51	auto
534	80	30	51	auto
23	80	30	51	auto
995	80	30	51	auto
886	80	30	51	auto
35	79	29	50	auto

The human-in-the-loop narrowing at 100→10→1 is mandatory — the evaluator detects rule violations; only the human senses when the rules themselves must change. Identical seed and corpus version reproduce this table exactly.

Operationalizes the Taste as a Moat framework.

From the books

Book 1, Chapter 9 — "The Force Multiplier: Taste-Transmitter and Player-Coach" (the Taste-as-Filter evaluator architecture, Sofia Marchetti's staleness detector, and the Flattening Effect mitigations).
Book 1, Chapter 7 — "The Invisible Moat" (Taste as a Moat; the Midjourney example).

Taste Evaluator Blueprint

What it does

Taste as a Moat

The New Triumvirate

Triumvirate Role Mapper

Defensibility Stack Assessment (A.G.E.N.T.)

Essays fromthe lineage break.

Essays from
the lineage break.