Intro

The Scaling Trap

2 min read

The most common scaling trap in service businesses is this: revenue grows, but so does the founder's working hours — at the same rate. The agency earns more but the founder's life doesn't improve. The business is still entirely dependent on one person's judgment, time, and presence.

AI agencies have a structural advantage here. AI can absorb the repetitive, high-volume parts of service delivery — the parts that, in a traditional agency, would require a full-time hire at $50K–$80K per year. This creates a genuine decision framework: for each function that needs to scale, is the bottleneck a repetitive process (AI handles it), a judgment call (human required), or an accountability obligation (human required, and their name needs to be on it)?

The skill of this module is systems evaluation — the O*NET competency of identifying the components of a system, understanding how they interact, and deciding how to change one component without breaking the others. Applied to agency scaling, this means knowing exactly what each hire or automation decision changes, and what it leaves the same.

Portfolio artifact — Skill

A hire-vs-AI decision matrix for your agency: evaluated functions, decision criteria applied, recommendations with cost and governance rationale, and a six-month scaling roadmap.

By the end of this module, you will:

Apply the three-test decision framework to agency functions that need to scale
Distinguish repetitive processes from judgment calls from accountability obligations
Calculate the break-even point between automation investment and headcount cost
Use O*NET systems evaluation principles to identify where a change in one function affects others
Build a six-month scaling roadmap with hire vs. AI decisions justified

Scenario

The $100K Decision

3 min read

An AI content agency reaches $100,000 in annual revenue. The founder works roughly 40 hours per week: 15 hours on delivery, 10 hours on client management, 8 hours on sales, 4 hours on administration, and 3 hours on strategy and improvement. The business is sustainable, but growth requires more capacity. The founder needs to add approximately 20 more hours per week to the system to take on additional clients.

Two paths exist. The first: hire a part-time contractor at $25–$35 per hour. For 20 hours per week, that's approximately $26,000–$36,000 per year. The contractor handles delivery tasks — content production, intake processing, brief preparation. The founder supervises but reduces direct delivery hours. The risk: the contractor requires training, supervision, and quality oversight. Adding headcount adds management overhead.

The second: invest in better automation. The founder currently spends 6 hours per week on intake and brief preparation that could be 80% automated (Module 5). Automating these frees 5 hours immediately. An additional 3 hours could be reclaimed from better delivery tooling. This gets to 8 additional hours without adding headcount. The remaining 12 hours needed for growth requires a hire — but a smaller one, 12 hours per week, $16,000–$20,000 per year.

The difference between Path 1 and Path 2 is not just cost. It's what each path makes the agency capable of. Path 1 scales by adding human capacity. Path 2 scales by making existing capacity more efficient, then adds targeted human capacity only where automation cannot substitute. Path 2 is more fragile in month one and more resilient in month six.

The decision matrix is the tool for making this choice explicitly, not intuitively.

Lesson

The Three-Test Decision Framework

3 min read

Every function in an agency can be evaluated against three tests. The tests determine whether the function can be automated, requires a human, or requires a human with named accountability. Getting this wrong in either direction — automating what needs human judgment, or hiring for what automation handles better — wastes money and creates operational risk.

Test 1: The Repetition Test

Does this function follow a predictable pattern that repeats across clients or projects? If yes, it's a candidate for automation. The key qualifier: predictable. A function that looks repetitive but varies significantly in ways that affect output quality is not actually a repetition candidate — it's a judgment call wearing repetitive clothing. Intake form processing is a genuine repetition candidate. Client strategy recommendations are not, even though they follow a general structure.

Test 2: The Judgment Test

Does this function require evaluating novel situations where the right answer depends on context that isn't captured in a template? Judgment calls require humans. The specific question is: would a competent, trained AI consistently produce the right answer for this function? If the answer is "most of the time, but the exceptions matter," the function requires human oversight — at minimum, human review of AI output at the exceptions.

Test 3: The Accountability Test

Does this function require a named person to be accountable for the outcome? Not "someone reviewed it" — a specific person whose professional judgment is behind the output. Gate 3 human review, client-facing strategy recommendations, and contract negotiations all require named accountability. AI cannot be the accountable party — it has no professional standing, no liability, and no recourse if it's wrong. Any function where "who is responsible if this is wrong?" has to have a human name in the answer requires a human in the role.

Governance Standards — The Regulatory Layer

O*NET — Systems Evaluation (6.A.1.b)

The O*NET Systems Evaluation competency requires identifying measures or indicators of system performance and the actions needed to improve or correct performance relative to the goals of the system. Applied to agency scaling: you're evaluating your delivery system's performance (time per client, quality per output, founder hours per revenue dollar) and identifying which changes produce the most improvement without creating new failure modes. This is not just about cost — it's about what the system can and cannot do reliably.

O*NET — Critical Thinking (4.A.4)

The O*NET Critical Thinking competency — using logic and reasoning to identify the strengths and weaknesses of alternative solutions — is directly applicable to hire vs. AI decisions. The matrix is a critical thinking tool: it forces you to state the assumptions behind each decision, the failure modes of each alternative, and the criteria by which you'd evaluate whether the decision was right six months later. This is how strategic capacity decisions are made rigorously, not intuitively.

Context

Three Things the Matrix Must Surface

2 min read

A decision matrix that doesn't surface these three things will give you a recommendation but not the confidence to act on it.

1. The governance implication of each decision

Every hire vs. AI decision has a governance consequence. Automating the intake brief means the brief is now AI-generated — which means Gate 3 review must be stronger, because the brief drives all downstream AI outputs. Hiring a contractor for delivery means you need an accountability structure — who reviews their work, what criteria, what escalation? The matrix should name the governance change that each decision creates, not just the cost difference.

2. The break-even point

Automation has upfront cost — tooling, setup, your time. Headcount has recurring cost. The break-even point is how many months until the automation investment pays for itself compared to the headcount alternative. For a $5,000 automation setup that saves 5 hours per week versus a $25/hour contractor, break-even is approximately 8 weeks. At 9 months, the automation has saved the equivalent of a full-time month. Know the number before you make the decision.

3. The failure mode of each option

Automation fails silently and degrades over time. Headcount fails visibly and immediately. The question is not which option has fewer failure modes — both fail. The question is which failure mode you can recover from faster in your context. An agency with strong monitoring (NIST MEASURE from Module 5) can catch automation degradation early. An agency with good onboarding systems (Module 4) can recover from a contractor departure faster. Your recovery capacity should inform the decision.

You'll apply all three in the lab — building a hire vs. AI decision matrix for your agency's current scaling bottleneck.

◆ Skill Lab

Hire vs. AI Matrix

~20 minutes · 3-5 functions

What you're doing

You'll apply the three-test framework to 3-5 functions in your agency that need to scale. I'll push on governance implications, break-even math, and failure modes. You'll leave with a recommendation and a six-month scaling roadmap.

Roles

👤

You — Agency PrincipalYou're making capacity decisions. Use your actual agency or the $100K scenario from the Scenario page.

📊

AI — Systems EvaluatorI'll apply O*NET systems evaluation criteria, push on the break-even math, and flag governance implications you've missed.

Three-test framework

Repetition Test — predictable pattern, AI candidate

Judgment Test — novel context required, human required

Accountability Test — named person required, human required

Break-even — automation cost vs. headcount recurring cost

Governance implication — what changes in the delivery system

Failure mode — how each option breaks, and recovery speed

Success criteria

A decision matrix with recommendations for 3-5 functions, with break-even analysis and governance implications for each — plus a six-month roadmap.

Shift + Enter for a new line

✓ Module Complete

You've completed Module 7 of 8. Your decision matrix is your seventh portfolio artifact.

Next Module →