AESOP AI Academy · Internal Documentation

Version 2 Course Development:
Design Decisions Explained

How v2 courses are built differently, why the changes matter, and how the platform supports hands-on learning, responsible AI use, and age-appropriate development across levels.

Section 1

How v2 Course Development Differs from v1

The v1 and v2 systems were built for different jobs. V1 establishes foundational AI literacy — it's text-heavy, quiz-driven, and hub-delivered, suited to learners encountering AI concepts for the first time. V2 is built around doing: every module is anchored to a real lab, every lab produces something a learner can keep, and governance is woven into the content itself rather than treated as a separate unit.

The differences run through every layer — architecture, pedagogy, standards integration, and how AI is used inside the learning experience.

Dimension v1 (Electives / Foundations) v2 (2.0 Courses)
File format Content fragment — no <head>, no topbar. The hub (electives-hub.html) provides all chrome and extracts <style>, .tab-strip, and .content-area. Standalone full HTML page. No hub dependency. Every module is a self-contained file with its own topbar, tabstrip, CSS, and JavaScript.
Navigation model Hub-driven via ?lesson= URL parameter. Learner navigates lessons, quizzes, and labs within the hub's sidebar. Fixed topbar + 5-tab tabstrip (Intro → Scenario → Lesson → Context → Lab). Chrome is 92px total. Pages are fixed-position below.
Module structure 4 lessons × (lesson + quiz + lab) = 12 sections + module test. Content is linear, quiz-gated. 5 tabs per module: Intro, Scenario, Lesson, Context, Lab. One primary lab per module. No quiz — the lab is the assessment.
Lab type Single format: AI-driven conversation with a fixed system prompt. Progress measured by exchange count. Three types with different roles, thresholds, and artifacts — Debate (threshold 6), Skill (threshold 5), Build (threshold 4). Distribution is 2 Debate + 3 Skill + 3 Build per course.
Portfolio artifacts Implicit. Lab conversations may produce useful output, but there's no named artifact and no structured handoff to the learner. Every lab has a named artifact with a specific description. The ARTIFACT_DESC is included in the labComplete postMessage. Students know exactly what they produced.
Scenario design Story scenes with named fictional characters (e.g., "Marco looked at the screen and…"). Character-driven narrative. Real-world situations with no fictional protagonists. The learner is placed in the situation directly. Scenarios open with the problem, not a character description.
Governance integration Governance topics appear in lesson content as standalone concepts (e.g., a lesson on the EU AI Act). Standards are taught, not applied. Each module is assigned a primary governance standard (NIST, EU AI Act, UNESCO, ONET). The standard is named in the Lesson, referenced in the ACADEMY_GUARDRAIL, embedded in LAB_SYSTEM, and must appear in at least one FALLBACK.
Standards accountability No formal tracking. Module content may reference standards but there's no audit structure. COVERAGE matrix + EVIDENCE data in courses-v2.html. Every standard × component is rated (primary / partial / touched / none) with cited module, section, and exact quote as evidence.
Offline resilience If the API is unavailable, labs may fail silently or stall. Every module has 5–6 FALLBACKS. After one failed retry, the lab switches to offline mode automatically. Lab completion still fires. At least one fallback references the primary governance standard.
AI model Varies by module. Some modules specify a model, others defer to the proxy default. Standardized: claude-haiku-4-5-20251001 via /aesop-api/proxy.php. Consistent across all v2 modules.
Completion signal postMessage with type: 'labComplete' and exchange count. Same base structure plus artifactType and artifactDesc — the system records not just that the lab was finished, but what the learner produced.
The core shift: V1 teaches you about AI. V2 teaches you to do things with AI — and holds you accountable to a governance framework while you do it.

Section 2

Three Design Questions

Question 1
How do we support hands-on learning?

Every v2 module ends in a lab. There are no theory-only modules — if you can't apply it, you haven't finished it. The lab is not a quiz with right/wrong answers. It's a conversation with an AI that has a specific role, specific expertise, and specific instructions to challenge you until you've produced something worth keeping.

Three lab types create three different kinds of doing:

Debate Lab
Defend a Position
The AI takes the other side. You state a position, the AI challenges it with real technical and ethical pressure, and you either hold or revise under scrutiny. Threshold: 6 exchanges.
Skill Lab
Apply a Technique
The AI acts as a technical advisor who won't accept vague answers. You apply a specific framework — NIST MAP, chunking specs, vault design — to a real case you bring. Threshold: 5 exchanges.
Build Lab
Produce an Artifact
The AI acts as a rigorous reviewer. You design and document something real — a pipeline, a governance policy, a production spec — and the AI tests each section for completeness. Threshold: 4 exchanges.

Portfolio artifacts are real outputs. Each lab has a named artifact with a specific description. A debate lab produces a written defense across multiple scenarios. A skill lab produces a completed framework applied to a live case. A build lab produces a document the learner can deploy, ship, or show. The artifact description is included in the completion signal — not just "lab finished" but "what was built."

Completion is measured by engagement depth, not time. The exchange thresholds (4–6) are calibrated so that a learner who gives shallow answers hits friction — the AI requires specificity before moving forward. A learner who engages seriously completes faster. There's no clock, and there's no minimum word count. Quality pressure comes from the AI's role, not from the system.

Question 2
How do we support responsible AI use?

V2 treats responsible AI use as a structural requirement, not a topic. It appears in four places in every module — in the lesson content, in the lab guardrail, in the lab system prompt, and in the offline fallbacks. It can't be skipped because it's embedded in the mechanics of the lab itself.

The ACADEMY_GUARDRAIL is defined once per module and applied in every API call. It names the primary governance standard for that module and restricts the AI to staying on topic. The guardrail isn't a disclaimer — it's operational. It shapes every response the lab AI gives.

Governance standards are assigned, not optional. Each of the 8 modules in a course is assigned a primary governance standard from the six frameworks the platform tracks: EU AI Act, NIST AI RMF, UNESCO AI Recommendation, O*NET, AI4K12, and CSTA. That standard must be named in the lesson, applied in the lab, and cited in the COVERAGE/EVIDENCE matrix. If a module claims a standard as "primary," evidence must exist — a specific section, a specific quote.

The COVERAGE/EVIDENCE matrix holds the platform accountable. Every course has a per-standard, per-component coverage rating. Every rating above "touched" requires an evidence citation: module number, section name, and a quoted excerpt from the actual content. This is not a marketing claim — it's an auditable record of what the platform actually teaches.

Students learn to govern AI, not just use it. The debate labs specifically put governance pressure at the center of the lab. In Module 7 of the Agents course, the AI plays the role of a regulatory examiner applying EU AI Act Articles 9, 14, and 22 to deployment proposals the learner defends. The learner isn't graded on whether they get the right answer — they're evaluated on whether they can engage with governance pressure at all.

The AI model is conservative by design. V2 uses claude-haiku-4-5-20251001 — a fast, capable model appropriate for educational conversation at scale. It's not the most powerful model available, and that's intentional. Lab AI is an interlocutor, not an authority. Students should be forming their own positions, not deferring to the model's.

Question 3
How do we support age-appropriate development across levels?

The platform currently serves two distinct audiences across its two versions, with different design priorities for each. V1 and V2 are not competitors — they are different points on a developmental arc.

V1 is the entry point. Foundation courses are designed for general adult and youth learners encountering AI concepts for the first time. Content uses accessible language, story-driven scenarios with named characters, short quiz questions, and manageable lab exchanges. The hub chrome provides consistent reading controls — font size, contrast, dark mode — because the learner's comfort with the interface matters as much as the content. V1 doesn't assume technical background.

V2 is for practitioners. V2 courses assume learners who are already using AI — developers, analysts, educators, professionals making real decisions about AI deployment. Scenarios don't use fictional characters because practitioners don't need narrative scaffolding; they need to see themselves in the situation directly. Labs use real framework names (NIST MAP, EU AI Act Art.13) because the learner will encounter those names in real work.

Age-appropriate isn't only about vocabulary — it's about cognitive load and stakes. For younger or earlier-stage learners, the platform manages load by breaking content into small, sequenced chunks with immediate feedback (quiz, lab, test). For advanced learners, the platform creates productive difficulty — the lab AI won't accept vague answers, won't skip to the next question, and will push back on positions that aren't supported with evidence. That pressure is appropriate for the level.

Responsible AI content is calibrated by audience. A v1 module on AI ethics asks: should this AI be used here at all? A v2 module on AI governance asks: who owns this component, what's the failure response plan, and which EU AI Act article applies? The underlying values are the same — human oversight, fairness, accountability — but the specificity and stakes differ. Younger learners build intuitions; practitioners build compliance-ready frameworks.

Cross-level design principles that apply to both:

  • No account required, no paywall, no tracking — both versions are free and open
  • Labs are completable offline — the fallback system fires after one retry, so a lab never stalls because the API is down
  • Content avoids AI hype in both directions — neither "AI will solve everything" nor "AI will end everything"; every module grounds the learner in real decisions with real tradeoffs
  • Governance is taught as a skill, not a warning — students learn to apply frameworks, not just memorize them

What's next: Future v2 courses may introduce explicit audience targeting — modules designed for K–12 educators, for healthcare workers, for policy staff. The standards alignment infrastructure (AI4K12 for K–12, CSTA for computing educators, O*NET for workforce) is already built. The COVERAGE matrix can track which courses serve which audiences at which depth. The architecture is ready for differentiation; it's a content decision, not an engineering one.


Section 3

V2 Platform by the Numbers

4
v2 courses live
32
modules across all courses
8
debate labs per platform
12
skill labs per platform
12
build labs per platform
6
governance standards tracked

Standards tracked: EU AI Act (7 components), NIST AI RMF (4 functions), UNESCO AI Recommendation (6 principles), O*NET (4 competencies), AI4K12 (5 big ideas), CSTA (4 practices). Every v2 course has a COVERAGE matrix rating each component and an EVIDENCE record citing specific modules, sections, and quoted text.