Intro

How the Work Actually Gets Done

2 min read

Most AI agencies fail at delivery, not at sales. They close clients, start projects, and then improvise the production process every single time. Each project runs differently. Quality is inconsistent. Timelines slip because no one documented how long each step actually takes. Client expectations are managed through heroic individual effort rather than a repeatable system.

A delivery system is what turns an agency from a freelancer with multiple clients into a business. It defines exactly how a project moves from client intake to final delivery — what happens at each step, who is responsible, what quality checks occur, and what happens when something goes wrong. For an AI agency, the delivery system also defines where AI is in the process, where human judgment takes over, and how outputs are reviewed before they reach the client.

The NIST AI RMF MANAGE function exists specifically for this: managing AI risk in operational deployment. Delivery SOPs are how that management happens in practice. Without them, governance is aspirational. With them, it's operational.

Portfolio artifact — Build

A delivery SOP for your core service tier: step-by-step from client intake through AI workflow, human review gate, and final handoff — with named accountability at each stage and an escalation path for output failures.

By the end of this module, you will:

Map a complete delivery workflow from intake to handoff for your core service
Identify where AI is used and where human judgment is required at each stage
Design a human review gate with specific quality criteria
Document an escalation path for when AI output fails quality standards
Apply NIST MANAGE principles to make your delivery system a governance document

Scenario

The Growth Chaos Problem

3 min read

An AI content agency grows from two clients to twelve in eight months. The founder handles all client work personally for the first six months. Projects run smoothly — because the founder knows every client's preferences, checks every output intuitively, and escalates to themselves when something seems off.

At twelve clients, the founder brings on a part-time contractor. The contractor is competent. But they don't know what "good" looks like for each client. They don't know when to flag an AI output for human review versus when to send it directly. They don't know what the founder checks before delivery. Within thirty days, two clients flag quality issues that the founder would have caught. One client cancels.

The founder realizes the problem: the delivery system exists only in their head. The quality checks are intuitive. The escalation path is "text the founder." There's no documentation of what AI handles, what the contractor reviews, or what the founder approves. Every new person on the team has to learn by watching, and learning by watching doesn't scale.

The fix is a delivery SOP — a documented workflow that any trained contractor can execute, with defined checkpoints, explicit quality criteria, and a named escalation path. When built correctly, the SOP also becomes the agency's governance document: proof that AI outputs are reviewed, that human judgment is applied at defined points, and that there is accountability for what reaches the client.

Without the SOP, growth creates chaos. With it, growth creates capacity.

Lesson

The Five-Gate Delivery Model

3 min read

A delivery SOP built around five gates ensures that every project passes through the same checkpoints, in the same order, with the same accountability at each step. The gates are not bureaucracy — they are the minimum viable structure that prevents quality failures from reaching clients.

Gate 1: Intake

Client brief is collected, clarified, and confirmed before any AI work begins. The intake gate prevents the most common failure mode in AI content work: garbage-in garbage-out. If the brief is vague, the AI output will be vague. The intake gate is where scope is confirmed and ambiguities are resolved.

Gate 2: AI Production

The AI workflow runs. Prompts are standardized for the niche. Outputs are generated. This gate is fully AI-executed — no human judgment required except to run the workflow correctly. The output of this gate is a draft, not a deliverable.

Gate 3: Human Review

A human reviews the AI draft against specific quality criteria. Not "does this look good?" — that's not a criterion. The criteria are defined per service type: factual accuracy, brand voice compliance, structural completeness, regulatory language if applicable. This is where governance happens. The reviewer either approves the draft or sends it back to Gate 2 with specific revision instructions.

Gate 4: Client Handoff

The approved output is packaged and delivered. The handoff includes any required documentation — version number, review timestamp, the reviewer's name. For regulated industries, handoff documentation is part of the compliance record.

Gate 5: Feedback Loop

Client feedback is captured and used to improve the intake brief and AI prompts for the next cycle. The feedback gate prevents the same quality issues from recurring.

Governance Standards — The Regulatory Layer

NIST AI RMF — MANAGE Function

The MANAGE function requires that identified AI risks be treated through ongoing monitoring, response plans, and documentation. A five-gate delivery SOP is the operational implementation of MANAGE: Gate 3 (human review) is the monitoring checkpoint, the escalation path is the response plan, and handoff documentation is the audit trail. An agency that can show a client their delivery SOP has operationalized NIST MANAGE — not just described it.

EU AI Act — Article 14: Human Oversight

Article 14 requires that high-risk AI systems are designed to enable appropriate human oversight during deployment. For AI agencies, Gate 3 is the human oversight mechanism. It must be real — not a rubber stamp. The quality criteria must be specific enough that a reviewer would catch the failures the AI is likely to produce in this niche. If Gate 3 is "skim it and send," Article 14's requirement for effective human oversight is not met.

Context

Three Things a SOP Must Specify

2 min read

A delivery SOP that doesn't specify these three things will fail under operational pressure. When a contractor is tired or rushed, vague SOPs get skipped. Specific SOPs get followed.

1. Who is responsible at each gate

Not a role category — a specific role or name. "Someone reviews this" is not a SOP. "The lead reviewer approves Gate 3 before any output leaves the system" is a SOP. Accountability requires specificity. When something goes wrong, the first question is: who was responsible at the gate where the failure occurred? If your SOP can't answer that, the accountability structure doesn't exist.

2. What "pass" looks like at Gate 3

The human review gate needs explicit pass criteria, not a general quality standard. For a B2B content agency: "All facts verified against source. Brand voice checklist complete. No claims not in the client brief. No competitor references." A reviewer applying these criteria will catch the failures that matter. A reviewer applying "looks good" will not. The criteria are derived from your NIST MAP analysis — the failure modes you identified in Module 2.

3. The escalation path when Gate 3 fails

What happens when a reviewer finds a Gate 3 failure? Back to Gate 2 with revision instructions — who writes those instructions, and what format do they use? Does the failure get logged? Does the client get notified of a delay? Is there a threshold (e.g., three Gate 3 failures on the same project) that triggers a different response? The escalation path is what separates a governance document from a checklist.

You'll apply all three in the lab — building a complete delivery SOP for your core service tier with named accountability, specific Gate 3 criteria, and a documented escalation path.

◆ Build Lab

Delivery SOP

~25 minutes · 5 gates

What you're doing

You'll document your delivery SOP across five gates. I'll ask for specifics at each gate — especially Gate 3 — and push back on anything too vague to execute. By the end, you'll have a SOP you could hand to a contractor on day one.

Roles

👤

You — Operations DesignerYou're documenting the delivery system for your agency. Use your core tier from Module 2, or a representative service.

🔍

AI — SOP ReviewerI'll flag vague accountability, missing pass criteria, and gaps in the escalation path. I'm simulating a contractor who has to follow this document on day one.

Five-gate framework

Gate 1: Intake — brief collected, scope confirmed

Gate 2: AI Production — standardized workflow runs

Gate 3: Human Review — specific pass criteria, named reviewer

Gate 4: Client Handoff — packaging, documentation

Gate 5: Feedback Loop — input for next cycle

NIST MANAGE — escalation path for Gate 3 failures

EU AI Act Art.14 — human oversight must be real, not nominal

Success criteria

A SOP a contractor could execute on day one: named accountability, specific Gate 3 criteria, and a documented escalation path. No vague steps.

Shift + Enter for a new line

✓ Module Complete

You've completed Module 4 of 8. Your delivery SOP is your fourth portfolio artifact.

Next Module →