◆ /pipeline · the sequence

A stack of nested loops
with explicit exit criteria.

The pipeline runs as the main Claude Code session. Each phase has a contract; each loop has an exit checklist; each gate is enforced by a hook that exit-2's until evidence is real. The README is the source of truth; this page is the visual map.

◆ Flight plan · interactive

Trace a run, phase by phase

A live signal travels the spine of the pipeline. Hover any phase to inspect its owners, the artifact it emits, and the contract that must be true before the next phase fires.

▸ LIVE_INSPECTOR

03/10

PHASE 1

PLAN

Plan & Validate

100%-coverage hard gate. 12-condition exit checklist.

▸ OWNERS

system-architectproposal-refiner

▸ EMITS

openspec proposal (valid)

▸ CONTRACT

01openspec validate --all --strict must be valid — independently re-run by the Stop hook (v3.9.2) so a skipped or mis-reported verdict cannot pass.
02Every requirement has ≥ 1 measurable scenario; every both-layer requirement has an explicit front-to-back integration criterion.
03Reuse decisions cite real files/symbols in CODEBASE_MAP; no duplicate capabilities.
04Phase 2 cannot start until all 12 conditions hold.

▸ HOVER OR CLICK A PHASE TO INSPECT

◆ Node grid

10 phases at a glance

PHASE 1

PLAN

Plan & Validate

100%-coverage hard gate. 12-condition exit checklist.

01openspec validate --all --strict must be valid — independently re-run by the Stop hook (v3.9.2) so a skipped or mis-reported verdict cannot pass.
02Every requirement has ≥ 1 measurable scenario; every both-layer requirement has an explicit front-to-back integration criterion.
03Reuse decisions cite real files/symbols in CODEBASE_MAP; no duplicate capabilities.
04Phase 2 cannot start until all 12 conditions hold.

◆ Phase contracts

Every phase, full detail

PHASE

−1

INTAKE

Intake & Mapping

Build CODEBASE_MAP, ROUTE_MAP, DESIGN_MAP, INTEGRATION_MAP.

›Cartographer + route-mapper produce per-codebase maps.
›3 codebase-map-reviewer agents argue in parallel until all return ok — no iteration cap (v3.8.0 unbounded solving).
›Integration mapping converges 3 explorers → master-synthesizer.
›Freshness short-circuit: skip when last_mapped ≥ git head.
›v3.33.0: a claude.ai/design/p/<id> link (or a `claude_design` MCP mention) is auto-detected here and the whole design project is fetched natively — no zip download.

PHASE

DETECT

Detect & Normalize

Normalize OpenSpec / Superpowers / plain markdown briefs.

›Orchestrator inspects the requirements folder and converts to one internal contract.
›Initializes coverage-map.json.
›Phase 0.1 (v2.18.0): consults the per-codebase discipline registry and auto-applies safe disciplines; routes the rest as Solution Requirements.
›Phase 0c (v3.5.0): dispatches the data-engineering-exploration pipeline when data-plane work (dbt, Airflow, Snowflake, Kafka, feature-stores) is detected.

PHASE

PLAN

Plan & Validate

100%-coverage hard gate. 12-condition exit checklist.

›openspec validate --all --strict must be valid — independently re-run by the Stop hook (v3.9.2) so a skipped or mis-reported verdict cannot pass.
›Every requirement has ≥ 1 measurable scenario; every both-layer requirement has an explicit front-to-back integration criterion.
›Reuse decisions cite real files/symbols in CODEBASE_MAP; no duplicate capabilities.
›Phase 2 cannot start until all 12 conditions hold.

PHASE

SPAWN

Team Spawn

Long-lived named teammates with their own 1M-context windows.

›Claude Code Agent Teams primitive by default (Lead + N teammates, shared task list, SendMessage for direct messaging).
›Falls back to ephemeral subagents when CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1 is missing or Claude Code < 2.1.32 — mode recorded in intake-state.json.
›v3.30.0: every spawn brief carries a mandatory `[CT6-TEAMMATE <name> RUN <id>]` first line so the sticky run-arm recognizes teammate sessions.

PHASE

REVIEW

Review Gate

Hook-enforced. Evidence schema v7 — 17 fields + independent_review.

›PostToolUse(TaskUpdate) exit-2's until every field validates.
›5 VAO fields (oracle_match / baseline_clean / no_fake_data / adversarial / skill_invocation_audit) each block on fail.
›Independent reviewer ≠ teammate. 3 rejections → escalation handoff.
›20 Layer-3 verification tools cover pipeline bypass, unilateral override, proxy-element substitution, deploy mandate, prod-safe classification.
›v3.42.0 principles-injection: every agent carries the compiled 7-principle ETHOS block; reviewers reach for a tie-breaker that's written down once, not scattered across 48 skill bodies.

PHASE

RECONCILE

Reconciliation

Shared boundaries; contract sync between teammates.

›No new feature code at this phase.
›Resolve cross-team shared types and API contracts.
›cdlg_overlap flags shared hot callees (call-graph closure intersection), not just shared files.

PHASE

INTEGRATE

Integration

Real backend. Playwright. Visual-fidelity. UI interaction.

›Full-stack tests run against the real running app.
›Editability + visual verification teams independently re-verify.
›Test-failure RCA: forward + backward + alt-hypotheses, 3-pass rigor floor mandatory.
›MemPalace gains function-level lineage records keyed by func:// IDs; Session Review (v3.26.0) + Seeded MemPalace (v3.27.0) feed cross-run learning.
›v3.42.0 recall data-envelope: every MemPalace-rendered block is wrapped in a do-not-interpret data envelope at the render boundary — recalled context is data, never instructions.

PHASE

LOOP

Outer Loop

Per-task-group dependency graph + unbounded worklist.

›Iterate task groups in dependency order.
›Solution Requirements (SRs) auto-spawn fix teams on every surfaced issue; test-failure origins route through diagnostic-research first.
›v3.8.0: no iteration ceiling — the loop runs until everything is green.

PHASE

MASTER

Master Review

Coverage map fully green; re-spawn on gap.

›Master review verdict must be overall: pass.
›Any gap re-spawns the originating team.
›Verdict is written where the Stop hook can independently re-audit it (v3.9.2).

PHASE

REPORT

Final Report

Per requirement → commit → auto-merge to main → push. Worktree pruned.

›Stop-hook completion audit is a non-halting worklist (v3.8.0) + deterministic openspec validate --all --strict re-run (v3.9.2) that blocks the commit on any invalid active change.
›v3.30.0 run-continuity marker at .architect-team/active-run.json must be cleared via --mark-complete (the only path out); --stand-down is the user-explicit halt.
›v3.7.0 default: clean architect-team/<slug> branch is merged --no-ff into main, pushed, deleted (local + remote); worktree removed.
›Conflicts or protected-branch rejects fall back to feature-branch + PR. --force is never added.
›Opt out per-run with --no-commit / --no-push / --no-auto-merge / --no-compact / --no-teams.

◆ v3.17 – v3.29 · CT6-6 across the pipeline

The server tier + the verification layer the gates read

▸ CT6-6 server tier · v3.23 → v3.29

Librarian · Session Review · Seeded MemPalace · Triage/Evaluator

Runs entirely on the local machine — not a deployed / production service. Every services/**/*.py is asserted import-clean stdlib + in-repo only at module load, so closed / paid pieces plug through adapter seams and each service lifts cleanly into its own repo.

›/architect-team:librarian-install — full lifecycle CLI + per-OS boot descriptor (launchd / systemd / schtasks), never auto-loaded.
›Session Review + Seeded MemPalace feed cross-run learning back into Phase 5.
›Real Anthropic adapter when ANTHROPIC_API_KEY resolves, else honest provisioned-but-disabled state.

▸ Verified Agent Output (VAO) · Layer 3

20 verification tools + a unified virtue-framed / bypass detector

Every Phase 3 self-review carries 5 VAO verdicts (oracle_match / baseline_clean / no_fake_data / adversarial / skill_invocation_audit) — any fail blocks the gate.

›pretool_unilateral_override_guard.py (v3.0.0) — the unified virtue-framed / element-of-bypass detector; catches drift at action time, not Stop time.
›pretool_skill_gate.py (v3.15.0) — skill-invocation hard gate; blocks build tools until the pipeline Skill is re-invoked after compact.
›Pipeline-bypass detection (v2.22.0), proxy-element substitution (v2.21.0), deploy-mandate contract (v2.20.0), in-flight clarification processing (v2.19.0).

▸ Phase 0c · data-engineering-exploration (v3.5.0)

The data-plane analog of visual-to-api-design

When the brief touches dbt / Airflow / Snowflake / Databricks / Kafka / feature-store work, Phase 0 dispatches a 7-stage exploration pipeline that emits an OpenSpec brief Phase 1 validates like any other — with ≥ 1 blocker-severity validation rule per transformation and end-to-end lineage + aggregate + per-endpoint metrics.

▸ Code & Data Lineage Graph (CDLG) · P0–P6

Per-endpoint call traces the diagnostic team consumes

Deterministic core in hooks/lineage_graph.py; runtime-witness reconciliation reuses code-path-witness.json as ground truth. Rename-stable func:// / asset:// IDs join MemPalace records across renames.

▸ Gateway integration · v3.38 → v3.42

Three modes. one confirm probe. a heal path for every drift.

▸ subscription

Fable via Claude sign-in; no split, no api-key. Uniform-fable ship state.

▸ provisioned

Local LiteLLM gateway on api-key auth; uniform-fable model policy. Env block written to ~/.claude/settings.json.

▸ external-llm

Split policy secondary-split — dev-class agents route through the gateway to ct6-secondary (OpenAI Codex or Z.ai GLM 5.2), spawn-compat via claude-haiku-4-5 impersonation.

Verify-then-swap deploy (v3.41.0): a staging instance passes the full ladder — bind + served-state, real-key completion with upstream identity via /model/info, auth enforcement — before the cutover. Old instance keeps serving on any staging failure. Launcher refuses to bind an already-held port.

Activation-drift heal (v3.41.1): install status names DRIFTED when recorded consent doesn't match the settings.json env block; carry-forward install re-applies from the persisted master key; SessionStart re-applies when the gateway port is live.

Honest disclosure: requests labeled claude-haiku-4-5 through this gateway are answered by the chosen secondary (e.g. GLM 5.2), not by Anthropic's Haiku. The mapping is recorded in gateway.json, printed by status, and test-pinned.

Honest boundary: live polyglot call-graph extraction against arbitrary target repos is the agent's runtime job and is not yet claimed proven end-to-end. The deterministic schema, IDs, witness reconciliation, freshness, and cost pieces are unit-tested.

◆ Logic maps

How flow is decided

Logic Map A

Phase 3 review gate

Every TaskUpdate(completed) on a teammate-owned task is gated. The hook exits 2 (block) until the 12-field evidence schema + independent_review are valid. 3 consecutive rejections → escalation handoff.

Logic Map B

Issue → fix routing

Every surfaced issue becomes a Solution Requirement. Test-failure origins route through diagnostic research first; editability + interaction gaps go straight to a fix team. The loop closes when the originating check passes.

Logic Map C

Stop-hook completion audit

Non-halting worklist (v3.8.0) + deterministic openspec validate --all --strict re-run (v3.9.2) + v3.30.0 run-continuity guard. The Stop hook blocks Phase 8 while .architect-team/active-run.json is `active` even on a momentarily-clean worklist; after CT6_MAX_NO_PROGRESS_STOPS no-progress blocks it auto-writes escalation-pending.md so wedged runs surface instead of looping.

Read full logic maps in README →

◆ Hook interceptors

Seven wired events. evidence-driven gates.

Every gate in the pipeline is enforced by a hook. No agent can mark its own work done. The Stop hook re-runs openspec validate --all --strict (v3.9.2); the SessionStart hook heals v3.41.1 activation drift and wraps every v3.42.0 MemPalace recall in a do-not-interpret data envelope — a wedged run auto-escalates instead of looping.

HOOK_01

PostToolUse(TaskUpdate)

review-gate-task.py — evidence schema v7 (17 self-review fields + independent_review block). Exit 2 blocks completion until every field validates.

EXIT 2→PASS

HOOK_02

SubagentStop

teammate-idle-check.py — re-checks review-gate evidence when a teammate goes idle; feeds the continuous Solution-Requirement intake loop.

EXIT 2→PASS

HOOK_03

Stop

pipeline-completion-audit.py — non-halting worklist (v3.8.0) + deterministic openspec validate --all --strict re-run (v3.9.2) + v3.30.0 run-continuity guard (blocks Stop while .architect-team/active-run.json is `active`; auto-writes escalation-pending.md after CT6_MAX_NO_PROGRESS_STOPS no-progress blocks).

EXIT 2→PASS

HOOK_04

PreToolUse(Edit|Write|NotebookEdit)

pretool_unilateral_override_guard.py (v3.0.0) — the unified virtue-framed/bypass detector: blocks ad-hoc edits during an active run when no Skill(architect-team-pipeline) has been invoked.

EXIT 2→PASS

HOOK_05

PreToolUse(Task|Skill|Edit|Write|Bash)

pretool_skill_gate.py — v3.15.0 skill-invocation hard gate + v3.30.0 sticky run-arm. A user session that hasn't re-invoked the Skill since its last compact boundary is blocked from build tools until it does.

EXIT 2→PASS

HOOK_06

PreCompact

precompact-closeout.py — captures worklist + evidence pointers before context compaction so a resumed session can reconstruct state.

EXIT 2→PASS

HOOK_07

SessionStart

sessionstart-run-continuity.py — v3.30.0 injects the resume-via-Skill directive on start / resume / clear / compact; v3.41.1 activation self-heal re-applies a dropped ANTHROPIC_BASE_URL / ANTHROPIC_AUTH_TOKEN env block merge-preservingly from the persisted master key when the gateway port is live; v3.42.0 wraps every injected MemPalace recall in a do-not-interpret data envelope.

EXIT 2→PASS

◆ Examples · the artifacts the gates actually read

What's on disk when a phase passes

Every gate reads a JSON file. Below: the exact shape the Phase 3 hook requires, and the Solution Requirement the orchestrator picks up to spawn a fix team.

▸ Evidence schema v7 — gates Phase 3 (17 fields + independent_review)

.architect-team/reviews/T-042.jsonjson

{
  "task_id": "T-042-add-invoice-export",
  "spec_review": "pass",
  "quality_review": "pass",
  "real_not_stubbed": true,
  "tests": { "added": 4, "passing": 4 },
  "demo_artifact": "demos/T-042-export.mp4",
  "files_changed": [
    "apps/web/routes/invoices.export.tsx",
    "apps/api/handlers/invoices/export.ts",
    "tests/e2e/invoices.export.spec.ts"
  ],
  "reuse_compliance": "ok",
  "visual_fidelity_review": "pass",
  "test_completeness_review": "pass",
  "integration_testing_review": "pass",
  "ui_interaction_review": "pass",
  "oracle_match_review": "pass",
  "baseline_clean_review": "pass",
  "no_fake_data_review": "pass",
  "adversarial_review": "pass",
  "skill_invocation_audit": "pass",
  "independent_review": {
    "reviewer": "task-reviewer",
    "verdict": "pass",
    "spec_review": "pass",
    "quality_review": "pass",
    "real_not_stubbed": true,
    "reuse_compliance": "ok",
    "reviewed_at": "2026-05-31T14:22:08Z"
  }
}

Hook: PostToolUse(TaskUpdate) → exit 2 until every field validates. VAO adds 5 verdicts (oracle_match / baseline_clean / no_fake_data / adversarial / skill_invocation_audit).

▸ Solution Requirement — fix team brief

.architect-team/solution-requirements/SR-2026-05-31-014.jsonjson

{
  "id": "SR-2026-05-31-014",
  "status": "open",
  "origin": {
    "kind": "playwright-failure",
    "test": "tests/e2e/invoices.export.spec.ts",
    "discovered_by": "interaction-reviewer",
    "discovered_at": "2026-05-31T14:18:51Z"
  },
  "summary": "Export button fires request but never resolves; spinner hangs.",
  "acceptance_criteria": [
    "Clicking Export downloads a CSV within 3s for ≤1k rows.",
    "Failure path surfaces a toast and re-enables the button.",
    "Playwright covers both success and failure flows."
  ],
  "routing": {
    "diagnostic_research_required": true,
    "fix_team": "frontend+backend"
  }
}

Test-failure origins route through diagnostic research before a fix team spawns.

◆ Phase 1 · the planning hard gate

12 conditions. all must hold. no iteration cap.

Phase 2 cannot start until every condition is satisfied. The orchestrator runs the checklist each iteration; failures route to the proposal-refiner.

01openspec validate --all --strict returns valid: true.
02Every artifact (proposal, specs, design, tasks) has status: done.
03Every source requirement has ≥ 1 scenario.
04Every requirement's acceptance criteria are measurable.
05Every front-end requirement has an explicit Playwright user-flow spec.
06Every back-end requirement has explicit dev-API integration test criteria.
07Every both-layer requirement has a front-to-back integration criterion (or recorded mock_testing_authorized opt-out).
08Every new module / file / dep in design.md has a Reuse Decision citing CODEBASE_MAP.md.
09Every Reuse Decision cites a file/symbol that actually exists.
10No duplicate capabilities (cross-checked via CODEBASE_MAP / INTEGRATION_MAP).
11Every new third-party dep has a documented comparison against the existing stack.
12tasks.md creates a new file only where existing files cannot be extended.

orchestrator console · Phase 1 iteration Nbash

$ openspec validate --all --strict --json
{ "valid": true, "errors": [] }

$ openspec status --json
{ "proposal": "done", "specs": "done", "design": "done", "tasks": "done" }

▣ Phase 1 exit checklist
  [12/12] all conditions satisfied → unlocking Phase 2 (team-spawn)

◆ UI interaction fidelity · what v0.9.19 actually catches

Three failure modes. one hook field.

✗ fake-user-flow

Fake user-flow test

Ships as: page.request.post('/api/...') — bypasses the UI entirely.

Caught by: interaction-completeness flags zero genuine page.click on a non-stub control.

✗ placeholder-page

Placeholder page

Ships as: Route wired to <ComingSoon /> while the design specifies a real screen.

Caught by: Every route enumerated and classified live / placeholder / confirmed-stub.

✗ hardcoded-dynamic-value

Hardcoded dynamic value

Ships as: Mockup's 'Welcome back, Sarah' shipped to every user.

Caught by: dynamic-value-discovery classifies from context, not from the literal.

hooks/review_evidence_schema.py · v6 fieldpython

# ui_interaction_review takes "pass" | "n/a" | "fail"
#
#   pass — every interactive element genuinely user-flow-tested,
#          every page live, every value correctly static or
#          dynamically bound, OR a confirmed-stub.
#   n/a  — slice has no UI surface. REQUIRES non-empty
#          ui_interaction_review_note.
#   fail — BLOCKED by the hook. An unwired-control / placeholder-page /
#          hardcoded-dynamic-value gap must be escalated as an SR,
#          not marked complete.

A single hook-enforced field, orthogonal to integration_testing_review.

A stack of nested loopswith explicit exit criteria.

Trace a run, phase by phase

10 phases at a glance

Every phase, full detail

The server tier + the verification layer the gates read

Librarian · Session Review · Seeded MemPalace · Triage/Evaluator

20 verification tools + a unified virtue-framed / bypass detector

The data-plane analog of visual-to-api-design

Per-endpoint call traces the diagnostic team consumes

Three modes. one confirm probe. a heal path for every drift.

How flow is decided

Phase 3 review gate

Issue → fix routing

Stop-hook completion audit

Seven wired events. evidence-driven gates.

What's on disk when a phase passes

12 conditions. all must hold. no iteration cap.

Three failure modes. one hook field.

A stack of nested loops
with explicit exit criteria.